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ASSAYS, AGENTS, THERAPY AND DIAGNOSIS RELATING TO 
MODUIiATION OF CELLUIAR DNA REPAIR ACTIVITY 

The present invention relates to screening methods, 
peptides, mimetics, and methods of use based on the 
5 surprising discovery and characterisation of an interaction 
between known proteins and the establishment that such 
interaction plays a key role in DNA repair, and thus 
numerous cellular processes of interest in therapeutic 
contexts. Two proteins in question are XRCC4 and DNA ligase 
10 IV. Interaction between XRCC4 and DNA-PKcs/Ku is also 
indicated. 

The invention has arisen on th.e basis of the work of 
the present inventors establishing for the first time 
crucial information about XRCC4 . Some information was 
15 available on the physiological function of this protein, it 
having been implicated in the Ku-associated DNA double- 
strand break repair (KADR) apparatus. However, very little 
was known about its biological activity and what its role in 
the KADR apparatus actually is. Prior to the making of the 

20 present invention it was not feasible to provide assays 
useful as primary screens for inhibitors of XRCC4 . 

Furthermore, the inventors' new cloning work has 
identified a yeast homologue of mammalian DNA ligase IV. No 
physiological function has previously been assigned to 

25 mammalian DNA ligase IV, but the inventors' yeast work, 
including analysis of the effect of knock-out mutation in 
yeast, now establishes the physiological relevance of DNA 
ligase IV and thus provides indication of therapeutic 
contexts in which modulation of its function can be 

30 effected. 

The work disclosed herein establishing interaction 
between XRCC4 and DNA ligase IV, interaction between XRCC4 
and . DNA-PKcs/Ku, and also a biological role for such 
interactions, now gives rise to screening methods for 
35 identifying compounds which affect the interaction. 
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particularly those which interfere with it, and which may 
affect or modulate particular aspects of cellular DNA repair 
activity, useful in a therapeutic context, for example in 
the treatment of proliferative disorders, cancers and 
5 tumours, disorders involving retroviruses such as AIDS, 
human adult T-cell leukaemia /lymphoma. Type I diabetes and 
multiple sclerosis, and also in radiotherapy. Furthermore 
it gives rise to the rational design of peptides or mimetics 
or functional analogues which fulfil this function. 

10 

One of the most dangerous forms of damage that can 
befall a cell is the DNA double-strand break (DSB) , which is 
the principal lethal lesion induced by ionising radiation 
and by radiomimetic agents. Consequently, cells have 

15 evolved highly effective and complex systems for recognising 
this type of DNA damage and ensuring that it is repaired 
efficiently and accurately. Two major pathways have evolved 
to repair DNA DSBs in eukaryotes, homologous recombination 
and DNA non-homologous end- joining (NHEJ) , 

20 Much of what is currently known about DNA NHEJ in 

mammalian systems has been obtained through studies of a 
series of mutant rodent cell lines that were identified 
originally as being hypersensitive towards ionising 
radiation and which display severe defects in DNA DSB repair 

25 {reviewed in Jeggo et ai,, 1995; Roth et ai., 1995). 

Characterisation of these cell lines has revealed that they 
fall into three complementation groups, termed IR4, IR5 and 
IR7 . The hamster cell line XR-1 defines IR4, IRS consists 
of a number of independently isolated hamster cell mutants, 

30 and IR7 contains the hamster cell line V3 and cells derived 
from the severe combined immune-deficient (scid) mouse. 
Various studies have shown that IR4, IRS and IRS cells are 
defective in antibody and T-cell receptor V(D)J 
recombination . 

35 Considerable effort has been directed towards 
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establishing the nature of the gene-products defective in 
cells of IR4, IRS and IR7, and determining how they function 
in DNA NHEJ. As a result of such studies, it was shown that 
cells of IRS and IR7 are deficient in components of the DNA- 
S dependent protein kinase (DNA-PK) (Ku80 and DNA-PKcs, 
respectively) . DNA-PK is a nuclear protein Ser/Thr kinase 
that displays the unusual property of being activated upon 
binding to DNA DSBs or other perturbations of the DNA 
double-helix (Jackson, 1997) . In light of the biochemical 

10 properties of DNA-PK which have been established, an 

attractive hypothesis is that this enzyme serves as a DNA 
damage sensor in vivo. 

In contrast to cells of IRS and IR7, XR-1 cells of IR4 
are not deficient in a DNA-PK component, as evidenced by the 

15 fact that extracts of these cells have normal DNA end- 
binding activity (Getts and Stamato, 1994; Rathmell and Chu, 
1994; Finnie et al,, 1996) and DNA-PK activity {Blunt et 
ai., 1995), and that expression of neither Ku80 nor DNA-PKcs 
complements the V{D)J recombination or radiosensitivity 

20 defects of XR-1 cells {Taccioli et al,, 1994; Blunt et ai., 
1995) . Instead, it has been shown that DNA from human 
chromosome region 5ql3-14 complements XR-1 cells, the 
complementing gene being termed XRCC4 (Otevrel and Stamato, 
1995) . 

25 Furthermore, (Li et ai., 1995) have identified the 

XRCC4 gene recently through its ability to confer normal 
V(D)J recombination activity and partially restore the DSB 
repair defect on XR-1 cells, and have demonstrated that the 
XRCC4 locus is deleted in XR-1 cells. 

30 Interestingly, XRCC4 encodes a small 334 amino acid 

residue protein of calculated molecular weight of 38 kOa, 
and the human and mouse homologues of this protein have been 
shown to be approximately 75% identical (Li et ai., 1995). 
Perhaps surprisingly, however, sequence analyses reveal that 

35 XRCC4 is not significantly related to any previously- 
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characterised proteins. Therefore, although it is clear 
that XRCC4 plays a crucial role in DNA DSB repair and V(D)J 
recombination, the cloning and sequencing of the cDNA for 
this factor has so far provided little clue to its mechanism 
5 of action. 

The Li et al. paper is the only paper published on the 
XRCC4 protein as such prior to the priority date of the 
present invention. It reports that XRCC4 is not related to 
any other proteins and so its sequence gives no clear clues 

10 as to its function. Prior to the present work, therefore, 
the only assays available for XRCC4 were cellular 
radiosensitivity and cellular V{D)J recombination - assays 
that cannot be used as primary screens for inhibitors. 
Consequently, it was impossible to conceive of any 

15 biochemical screen for the activity of this factor. 

It should be noted too that the Li et al. paper does 
not provide any evidence that XRCC4 is a nuclear protein 
(shown herein) and discusses on page 1084 that XRCC4 has 
putative sites for cytoplasmic protein tyrosine kinases. 

20 Thus, it is clear that there really was nothing known about 
how this protein might act. 

The present inventors have shown that XRCC4 exists, at 
least in part, in the cell nucleus and demonstrated 
convincingly that it interacts with DNA ligase IV, and also 

25 DNA-PKcs/Ku. Evidence is provided herein in the 

experimental section, with confirmation being provided also 
by Mizuta et ai., 1997. Grawunder et al, 1997 has also 
provided evidence of interaction between XRCC4 and DNA 
ligase IV. See also the inventors' publications Teo and 

30 Jackson, 1997 and Critchlow et al, 1997. 

DNA ligases are catalysts which join together Okazaki 
fragments during lagging strand DNA synthesis, complete 
exchange events between homologous duplex DNA molecules, and 
35 seal single- or double-strand breaks in the DNA that are 
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produced either by the direct action of a DNA damaging 
agents or by DNA repair enzymes removing DNA lesions (for 
review, see Lindahl and Barnes, 1992) . In contrast to 
prokaryotic and yeast systems, where only a single species 
5 of DNA ligase has been previously been described (Johnston 
and Nasmyth, 1978), four biochemically distinct DNA ligases 
have been identified in mammalian cells (Tomkinson et ai., 
1991; Wei et <ai., 1995; Robins and Lindahl, 1996). In vitro 
assays, and studies of yeast and human cells containing 

10 mutated alleles of DNA ligase I suggest that this enzyme 

joins Okazaki fragments during DNA replication (Henderson et 
al., 1985; Malkas et al., 1990; Tomkinson et ai., 1991; 
Barnes et ai., 1992; Li et ai . , 1994; Prigent et ai., 1994; 
Waga et al., 1994). Furthermore, the sensitivity of DNA 

15 ligase I mutant cells to ultraviolet (UV) irradiation and 
some DNA damaging agents suggests that DNA ligase I is 
involved in nucleotide excision repair and base excision 
repair (Henderson et ai., 1985; Lehmann et al., 1988; Malkas 
et ai., 1990; Tomkinson et aJ., 1991; Barnes et ai., 1992; 

20 Li et ai., 1994; Prigent et ai., 1994; Waga et ai., 1994), 
Much less, however, is known about the function of the 
other three mammalian DNA ligases. It is currently unclear 
whether DNA ligase II and III arise from separate genes or 
by alternative splicing of the same gene (Roberts et al., 

25 1994; Wang et ai., 1994; Husain et ai., 1995). However, 
ligase II is induced in response to alkylation damage 
(Creissen and Shall, 1982), suggesting a role in DNA repair. 
Similarly, the elevation of a splice variant of ligase III 
(ligase III-P) levels in spermatocytes undergoing meiotic 

30 recombination (Chen et ai., 1995; Husain et ai . , 1995; 

Mackey et ai., 1997) and the association of another splice 
variant (ligase Ill-a) with the DNA repair protein XRCCl 
(Caldecott et ai., 1994; Thompson et ai., 1990) are 
consistent with this enzyme joining DNA strand breaks to 
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complete DNA recombination and repair (Jessberger et ai,, 
1993) . Indeed, DNA ligase III, when present in a complex 
with XRCC-1, can reconstitute the ligation event necessary 
to complete base excision repair in vitro (Kubota et al., 
5 1996) . 

A fourth enzyme, DNA ligase IV, has been purified 
recently from human cells and has distinct biochemical 
properties from other ligases {Robins and Lindahl, 1996) . 
The physiological function of mammalian ligase IV is, 

10 however, unknown. 

In most prokaryotes there is only one DNA ligase, and 
this enzyme catalyses all the DNA- joining events during 
replication, recombination and repair {Lindahl and Barnes, 
1992) . Similarly, genetic and biochemical data have 

15 suggested that there is only one DNA ligase in Saccharomyces 
cerevisiae (Lindahl and Barnes, 1992) , although 
fractionation of yeast cell extracts has given an indication 
of a second DNA ligase activity (Tomkinson et al., 1992). 
The present inventors searched for DNA ligase 

20 homologues in the S. cerevisiae genome, which was completely 
sequenced recently (Goffeau et al., 1996; Oliver, 1996). 
These searches identified a hitherto uncharacterised open 
reading frame (ORF) with sequence similarity along its 
entire length to mammalian DNA ligase IV. The experimental 

25 section below describes the effects of disrupting this gene, 
which the inventors have termed LIG4, on DNA replication, 
homologous recombination, and DNA repair in response to a 
variety of DNA-damaging agents. These studies show that 
LIG4 plays a crucial role in DNA double-strand break repair 

30 via the non-homologous end-joining {NHEJ) pathway but does 
not have an essential role in other DNA repair pathways 
studied. 

Furthermore, it is shown that LIG4 functions in the 
same DNA repair pathway that utilises the DNA end-binding 
35 protein Ku. However, the phenotype of lig4 mutant yeasts is 
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not identical to those of yeasts disrupted for Ku function, 
revealing that Ku has additional roles in genome 
maintenance . 



5 In summary, XRCC4 was known to be involved somehow in 

Ku-associated DNA double-strand break repair (FCADR) , but its 
biological activity was obscure. The present inventors have 
established for the first time biological activity of XRCC4, 
that is binding to DNA ligase IV. Furthermore, the 

10 physiological relevance of DNA ligase IV was not known. The 
inventors have now established that DNA ligase IV is 
important for double-strand DNA break repair via non- 
homologous end joining (NHEJ) - by unexpectedly identifying 
and cloning, then mutating, a yeast homologue gene and by 

15 establishing strong interaction between XRCC4 and DNA ligase 
IV. 

The inventors have also established that XRCC4 
interacts with DNA-PKcs/Ku, and shown that DNA-PKcs is able 
to phosphorylate XRCC4 . 

20 Based on this and other work described below, the 

present invention in various aspects provides for modulation 
of interaction between XRCC4 and DNA ligase IV. 

Various aspect the present invention provide for the 
use of XRCC4 and DNA ligase IV in screening methods and 

25 assays for agents which modulate interaction between XRCC4 
and DNA ligase IV. 

Further aspects provide for modulation of interaction 
between XRCC4 and DNA-PKcs/Ku and use of these molecules in 

30 screening methods and assays for agents which modulate 

interaction between XRCC4 and DNA-PKcs/Ku. For simplicity, 
much of the present disclosure refers to XRCC4 and DNA 
ligase IV. However, unless the context requires otherwise, 
every such reference should be taken to be equally 

35 applicable to the interaction between XRCC4 and DNA-PKcs/Ku. 
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Methods of obtaining agents able to modulate 
interaction between XRCC4 and DNA ligase IV (or, it must be 
remembered, XRCC4 and DNA-PKcs/Ku) include methods wherein a 
suitable end-point is used to assess interaction in the 
5 presence and absence of a test substance. Detailed 
disclosure in this respect is included below. It is worth 
noting, however, that combinatorial library technology 
provides an efficient way of testing a potentially vast 
number of different substances for ability to modulate bind 
10 to and/or activity of a polypeptide. Such libraries and 
their use are known in the art, for all manner of natural 
products, small molecules and peptides, among others. The 
use of peptide libraries may be preferred in certain 
circumstances . 

15 

Appropriate agents may be obtained, designed and used 
for any of a variety of purposes. 

One is anti-tumour or anti-cancer therapy, particularly 
augmentation of radiotherapy or chemotherapy. Ionising 

0 radiation and radiomimetic drugs are commonly used to treat 
cancer by inflicting DNA damage. Cells deficient in DNA 
repair, particularly the KADR pathway, are hypersensitive to 
ionising radiation and radiomimetics . Evidence provided 
herein shows the KADR pathway involves XRCC4 and DNA ligase 

5 IV, indicating that inhibition of their function, e.g. by 
inhibiting their interaction, will have an effect on the 
KADR pathway, DNA repair and cellular sensitivity to 
ionising radiation and radiomimetics. 

Another is the potentiation of gene targeting and gene 

0 therapy. Inhibition of KADR may be used to increase 

efficiencies of gene targeting, of interest and ultimate use 
in gene therapy. Two ways exist for repairing DNA double- 
stranded breaks (DSBs) . One is through the process of 
illegitimate recombination (also known as DNA non-homologous 

5 end-joining or NHEJ) and this is catalysed by the PCADR 
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system now known to involve XRCC4 and DNA ligase IV. The 
other system is the process of homologous recombination, 
whereby the damaged DNA molecule exchanges information with 
an undamaged homologous partner DNA molecule. In mammalian 
5 cells, the illegitimate pathway tends to predominate. 
Inhibiting the KADR system will make the proportion of DSBs 
repaired by homologous recombination increase. Thus, anti- 
KADR factor agents, including those provided in accordance 
with the present invention, will have this effect. 
10 Homologous gene targeting is used in making knock-out mice 
and other transgenic animals but it is not very efficient, 
so increasing this efficiency in accordance with the present 
invention will be highly beneficial. Ultimately, gene 
therapist wish to precisely replace the mutated gene with a 
15 functional one. At present just to get the functional gene 
to integrate anywhere in the genome is the priority, but the 
long-term aim is for integration at the right site. KADR 
(e.g. XRCC4 and/or ligase IV, or XRCC4 and/or DNA-PKcs/Ku) 
inhibitors therefore have a great therapeutic potential in 
20 such context. 

A further, related, purpose is in anti-retroviral 
therapy, since DNA repair pathways such as involving KADR 
and the components XRCC4 and DNA ligase IV are involved in 
effecting retroviral and retrotransposon integration into 
25 the genome of a host cell. Retroviruses are of considerable 
risk to the health of humans and animals, causing, int^r 
alia, AIDS, various cancers and human adult T-cell 
leukaemi a/ lymphoma . Integration of retroviral DNA into the 
genome is essential for efficient viral propagation and may 
30 be targeted by inhibition of DNA repair pathway components. 
Additionally, modulators of KADR components such as 
XRCC4 and DNA ligase IV, DNA-PKcs/Ku, may be used in 
modulation of immune system function, since such factors are 
required for generation of mature immunoglobulin and T-cell 
35 receptor genes by site-specific V(D)J recombination. 
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Compounds which stabilise the interaction between two 
components, such as XRCC4 and DNA ligase IV, or XRCC4 and 
DNA-PKcs/Ku, and which may up-regulate activity, may be 
screened for using assays in which conditions are too harsh 
5 for the relevant interaction. Agents which stabilise the 
interaction may be identified. One alternative is to screen 
for substances that enhance DMA ligase IV catalytic 
activity, which may be determined as discussed elsewhere. 
An up-regulator of activity may be used to potentiate DNA 

10 repair further, and this may be in normal individuals, with 
possible long-term beneficial effects bearing in mind that 
many of the common manifestations of ageing arise through 
the gradual and inexorable accumulation of mutations in 
somatic cells. Up-regulators may be used in treating 

15 patients who are debilited in the tCADR pathway or other DNA 
repair pathway. 

Interaction between XRCC4 and DNA ligase IV, or XRCC4 
and DNA-Pkcs/Ku may be inhibited by inhibition of the 

20 production of the relevant protein. For instance, 
production of one or more of these components may be 
inhibited by using appropriate nucleic acid to influence 
expression by antisense regulation. The use of anti-sense 
genes or partial gene sequences to down-regulate gene 

25 expression is now well-established. Double-stranded DNA is 
placed under the control of a promoter in a "reverse 
orientation" such that transcription of the "anti-sense" 
strand of the DNA yields RNA which is complementary to 
normal mRNA transcribed from the "sense" strand of the 

30 target gene. The complementary anti-sense RNA sequence is 
thought then to bind with mRNA to form a duplex, inhibiting 
translation of the endogenous mRNA from the target gene into 
protein. Whether or not this is the actual mode of action 
is still uncertain. However, it is established fact that 

35 the technique works. 
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Another possibility is that nucleic acid is used which 
on transcription produces a ribozyme, able to cut nucleic 
acid at a specific site - thus also useful in influencing 
gene expression. Background references for ribozymes 
5 include Kashani-Sabet and Scanlon, 1995, Cancer Gene 
Therapy, 2(3): 213-223, and Mercola and Cohen, 1995, Cancer 
Gene Therapy, 2(1), 4 7-59. 

Thus, various methods and uses of modulators, 
10 particularly inhibitors, of XRCC4 and DNA ligase IV, or 
XRCC4 and DNA-PKcs/Ku, interaction and/or activity are 
provided as further aspects of the .present invention. The 
purpose of disruption, interference with or modulation of 
interaction between XRCC4 and DNA ligase IV, and/or XRCC4 
15 and DNA-PKcs/Ku, may be to modulate any activity mediated by 
virtue of such interaction, as discussed above and further 
below . 

Brief Description of the Figures 
:0 Figure 1 illustrates co-purification of XRCC4 and DNA 

ligase IV from HeLa cells. 

Figure lA shows the results of quantitative Western 

immunoblot analyses for DNA ligase IV (diamonds) and XRCC4 

(squares) (percent of protein) for various fractions at each 
5 chromatographic stage of gel chromatography filtration for 

DNA ligase IV purification from HeLa cells. 

Figure IB shows results of quantitative Western 

immunoblot analyses for DNA ligase IV (diamonds) , XRCC4 

(squares) and DNA ligase III (circles) (percent of protein) 
0 for various fractions at each stage of Mono S column 

fractionation on the fraction marked with an asterisk in 

Figure lA. 

. Figure 2 shows that YOR005c encodes a homologue of 
mammalian DNA ligase IV, indicating amino acid sequence 
5 similarities between 5. cerevisiae Lig4p (scLIG4; the 
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product of the YOROOSc ORF) and human DNA ligase IV 
(hLIGIV) . The alignment was generated using the PILEUP 
programme on the GCG (Genetics Computer Group, Wisconsin) 
package, and identical and similar amino acid residues are 
5 indicated by reverse shading and grey shading, respectively, 
using the BOXSHADE programme. Amino acid residues are 
numbered from the amino teirmini of the full-length 
polypeptides. Gaps were introduced for maximum alignment. 
The active site lysine residue is indicated with an 
10 arrowhead. The "core" conserved region of DNA ligases of 
eukaryotes and eukaryotic viruses is delineated with a bar. 

Figure 3 shows that LIG4 funct.ions in the Ku-dependent 
pathway for repairing ionising radiation-induced DNA damage. 
The sensitivity of various yeast strains to killing by 
15 ionising radiation was judged by exposure to various 

radiation doses. Error bars are not shown for simplicity; 
standard deviation is < 5 % of each value point. 

Figure 3A shows % cell survival for wild-type, llg4 
mutants and rad52 mutants at various doses in kRads of 
20 ionising radiation. 

Figure 3B shows % cell survival for lig4/ ykulO mutants, 
rad52 mutants rad52/lig4 mutants, rad52/yku70 mutants and 
rad52/lig4/yku70 mutants at various doses in kRads of 
ionising radiation. 
25 Figure 4 shows that disruption of LIG4 results in a 

dramatic reduction in the ability to repair restriction 
enzyme generated cohesive DNA DSBs in plasmid (pBTM116) DNA, 
plotting transformant recovery relative to uncut control 
plasmid where various yeast strains, wild-type and mutant, 
30 were transformed with pBTM116 digested with EcoRl (left 
panel) or PstI (right panel) . 

Figure 5 shows a model in which XRCC4 serves as a 
molecular bridge to target DNA ligase IV to a DNA DSB. 

Figure 6 shows the amino acid sequence and encoding 
35 nucleotide sequence for S. cerevisxae LIG4 , provided in 
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accordance with aspects of the present invention. 
Translation begins at the start site indicated by the arrow. 

Amino acid and nucleic acid sequences of polypeptides 
5 useful in various aspects of the present invention are 
available from GenBank under the following accession 
numbers: human Ku70 - J04611; human Ku80 - M30938; S. 
cerevlslae Ku70 - X70379; S. cerevisiae Ku80 - Z49702; human 
ligase IV - X83441; S. cerevlsiae ligase IV - YOROOSc on the 
10 right arm of S. cerevislae chromosome XV, accession number 
Z74913; human XRCC4 - U40622 (334 amino acid residue open 
reading frame); human DNA-Pkcs - U4.7077 (Hartley et al, 
originally provided the sequence, though lacking an intron. 
Poltoratsky et al. provided a partial sequence including the 
15 intron not included in the Hartley et al. sequence. The 
sequence available from GenBank is complete.). 

All documents and GenBank sequences mentioned anywhere 
in this specification are incorporated by reference. 

20 

The present invention in various aspects provides for 
modulating, interfering with or interrupting interacion 
between the XRCC4 protein and DNA ligase IV, using an 
appropriate agent. The present invention also provides in 
25 analogous aspects for modulating, interfering with or 

interrupting interaction between the XRCC4 protein and DNA- 
Pkcs/Ku, using an appropriate agent. 

Such an agent capable of modulating interaction between 
30 XRCC4 and DNA ligase IV may be capable of blocking binding 
between a site located within amino acid residues 550-884 of 
human DNA ligase IV, which may be at one or other or both of 
the BRCT domains (discussed further below) , or between these 
domains, and a site on human XRCC4 . The site on DNA ligase 
35 IV may be between amino acid residues 591-676, between amino 
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acid residues 728-844 or between residues 611-121. The 
full amino acid sequence of the XRCC4 protein has been 
elucidated and is set out in Li et ai . {Cell (1995) 83, 
1079-1089) which is incorporated herein by reference, and of 
5 which the amino acid residue numbering is used along with 
the encoding nucleic acid sequence. The GenBank reference 
is indicated above. The DNA ligase IV amino acid and 
nucleotide coding sequences are given in Wei et al., of 
which the amino acid residue numbering is used, with the 
0 yeast LIG4 sequences being shown in Figure 6. The GenBank 
references are given above. 

Note, that recently Wilson, T.-E. 1997, Nature 388: 495- 
4 98 has suggested that the initiating methionine for human 
DNA ligase IV is upstream from that indicated by Wei et ai., 
5 1995, whose amino acid sequence numbering is used herein. 
See the Wilson paper for details. The present inventors 
have preliminary data which disagrees with that of Wilson. 
However, should Wilson turn out to be correct this would 
have no bearing on any aspect of the present invention. The 
0 presence or absence of additional amino acids at the N- 

terminus of DNA ligase IV is unlikely to have any effect on 
its interaction with XRCC4 , and the fact remains that the 
present inventors' work shows interaction of XRCC4 with the 
DNA ligase IV which occurs in human cells. 

5 

Agents may be identified by screening techniques which 
involve determining whether an agent under test inhibits or 
disrupts the binding of DNA ligase IV protein or a suitable 
fragment thereof {e.g. including amino acid residues 550- 
0 884, residues 591-676, residues 728-844 or residues 677-727, 
or a smaller fragment of any of these regions) of human DNA 
ligase IV, with XRCC4 or a fragment thereof, or a suitable 
analogue, fragment or variant thereof. 

Suitable fragments of XRCC4 or DNA ligase IV include 
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those which include residues which interact with the 
counterpart protein. Smaller fragments, and analogues and 
variants of this fragment may similarly be employed, e.g. as 
identified using techniques such as deletion analysis or 
5 alanine scanning. 

Thus, the present invention provides a peptide fragment 
of XRCC4 which is able to bind DNA ligase IV and/or inhibit 
interaction between XRCC4 and DNA ligase IV, and provides a 
peptide fragment of DNA ligase IV which is able to bind DNA 
10 ligase IV and/or inhibit interaction between DNA ligase IV 
and XRCC4, such peptide fragments being obtainable by means 
of deletion analysis and/or alanine scanning of the relevant 
protein - making an appropriate mutation in sequence, 
bringing together a mutated fragment of one of the proteins 
15 with the other or a fragment thereof and determining 
interaction. In preferred embodiments, the peptide is 
short, as discussed below, and may be a minimal portion that 
is able to interact with the relevant counterpart protein 
and/or inhibit the relevant interaction. 
20 Of course, similar considerations apply to XRRC4 and 

DNA-PKcs/Ku interacting portions. 

Screening methods and assays are discussed in further 
detail below . 

25 

One class of agents that can be used to disrupt the 
binding of XRCC4 and DNA ligase IV are peptides based on the 
sequence motifs of XRCC4 or DNA ligase IV that interact with 
counterpart DNA ligase IV or XRCC4 . Such peptides tend to 

30 be short, and may be about 4 0 amino acids in length or less, 
preferably about 35 amino acids in length or less, more 
preferably about 30 amino acids in length, or less, more 
preferably about 25 amino acids or less, more preferably 
about 20 amino acids or less, more preferably about 15 amino 

35 acids or less, more preferably about 10 amino acids or less, 
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or 9, 8, 1, 6, 5 or less in length. The present invention 
also encompasses peptides which are sequence variants or 
derivatives of a wild type XRCC4 or DNA ligase IV sequence, 
but which retain ability to interact with DNA ligase IV or 
5 XRCC4 (respectively, as the case may be) and/or ability to 
modulate interaction between XRCC4 and DNA ligase IV. 

Instead of using a wild-type XRCC4 or DNA ligase IV 
fragment, a peptide or polypeptide may include an amino acid 
sequence which differs by one or more amino acid residues 
10 from the wild-type amino acid sequence, by one or more of 
addition, insertion, deletion and substitution of one or 
more amino acids. Thus, variants, .derivatives, alleles, 
mutants and homologues, e.g. from other organisms, are 
included. 

15 Preferably, the amino acid sequence shares homology 

with a fragment of the relevant XRCC4 or DNA ligase 
IV fragment sequence shown preferably at least about 30%, or 
40%, or 50%, or 60%, or 70%, or 75%, or 80%, or 85% 
homology, or at least about 90% or 95% homology. Thus, a 

20 peptide fragment of XRCC4 or DNA ligase IV may include 1, 2, 
3, 4, 5, greater than 5, or greater than 10 amino acid 
alterations such as substitutions with respect to the wild- 
type sequence. 

A derivative of a peptide for which the specific 

25 sequence is disclosed herein may be in certain embodiments 
the same length or shorter than the specific peptide. In 
other embodiments the peptide sequence or a variant thereof 
may be included in a larger peptide, as discussed above, 
which may or may not include an additional portion of XRCC4 

30 or DNA ligase IV. 1, 2, 3, 4 or 5 or more additional amino 
acids, adjacent to the relevant specific peptide fragment in 
XRCC4 or DNA ligase IV, or heterologous thereto may be 
included at one end or both ends of the peptide. 

(It should not be forgotten that references to XRCC4 

35 and DNA ligase IV apply equally to XRCC4 and DNA-Pkcs/Ku . ) 
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As is well-understood, homology at the amino acid level 
is generally in terms of amino acid similarity or identity. 
Similarity allows for "conservative variation", i.e. 
substitution of one hydrophobic residue such as isoleucine, 
5 valine, leucine or methionine for another, or the 
substitution of one polar residue for another, such as 
arginine for lysine, glutamic for aspartic acid, or 
glutamine for asparagine. Similarity may be as defined and 
determined by the TBLASTN program, of Altschul et al. (1990) 
10 J. Mol . Biol, 215: 403-10, which is in standard use in the 
art. Homology may be over the full-length of the relevant 
peptide or over a contiguous sequence of about 5, 10, 15, 
20, 25, 30, 35, 50, 75, 100 or more amino acids, compared 
with the relevant wild-type amino acid sequence. 
15 As noted, variant peptide sequences and peptide and 

non-peptide analogues and mimetics may be employed, as 
discussed further below. 

Various aspects of the present invention provide a 
substance, which may be a single molecule or a composition 
20 including two or more components, which includes a peptide 
fragment of XRCC4 or DNA ligase IV which includes a sequence 
as included in the relevant GenBank entry, a peptide 
consisting essentially of such a sequence, a peptide 
including a variant, derivative or analogue sequence, or a 
25 non-peptide analogue or mimetic which has the ability to 
bind XRCC4 or DNA ligase IV and/or modulate, disrupt or 
interfere with interaction between XRCC4 or DNA ligase IV. 

Variants include peptides in which individual amino 
acids can be substituted by other amino acids which are 
30 closely related as is understood in the art and indicated 
above . 

Non-peptide mimetics of peptides are discussed further 
below . 



35 



As noted, a peptide according to the present invention 
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and for use in various aspects of the present invention may 
include or consist essentially of a fragment of XRCC4 or DNA 
ligase IV as disclosed, such as a fragment whose sequence is 
included in the relevant GenBank entry. Where one or more 
5 additional amino acids are included, such amino acids may be 
from XRCC4 or DNA ligase IV or may be heterologous or 
foreign to XRCC4 or DNA ligase IV. A peptide may also be 
included within a larger fusion protein, particularly where 
the peptide is fused to a non-XRCC4 or DNA ligase IV (i.e. 

10 heterologous or foreign) sequence, such as a polypeptide or 
protein domain. 

The invention also includes derivatives of the 
peptides, including the peptide linked to a coupling 
partner, e.g. an effector molecule, a label, a drug, a toxin 

15 and/or a carrier or transport molecule, and/or a targeting 
molecule such as an antibody or binding fragment thereof or 
other ligand. Techniques for coupling the peptides of the 
invention to both peptidyl and non-peptidyl coupling 
partners are well known in the art. In one embodiment, the 

20 carrier molecule is a 16 aa peptide sequence derived from 
the homeodomain of Antennapedia (e.g. as sold under the name 
"Penetratin" ) , which can be coupled to a peptide via a 
terminal Cys residue. The """Penetratin" molecule and its 
properties are described in WO 91/18981. 

25 Peptides may be generated wholly or partly by chemical 

synthesis. The compounds of the present invention can be 
readily prepared according to well-established, standard 
liquid or, preferably, solid-phase peptide synthesis 
methods, general descriptions of which are broadly available 

30 (see, for example, in J.M. Stewart and J.D. Young, Solid 
Phase Peptide Synthesis, 2nd edition. Pierce Chemical 
Company, Rockford, Illinois (1984), in M. Bodanzsky and A. 
Bodanzsky, The Practice of Peptide Synthesis, Springer 
Verlag, New York (1984); and Applied Biosystems 430A Users 

35 Manual, ABI Inc., Foster City, California), or they may be 
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prepared in solution, by the liquid phase method or by any 
combination of solid-phase, liquid phase and solution 
chemistry, e.g. by first completing the respective peptide 
portion and then, if desired and appropriate, after removal 
5 of any protecting groups being present, by introduction of 
the residue X by reaction of the respective carbonic or 
sulfonic acid or a reactive derivative thereof. 

Another convenient way of producing a peptidyl molecule 

10 according to the present invention (peptide or polypeptide) 
is to express nucleic acid encoding it, by use of nucleic 
acid in an expression system. 

Accordingly the present invention also provides in 
various aspects nucleic acid encoding the polypeptides and 

15 peptides of the invention. 

Generally, nucleic acid according to the present 
invention is provided as an isolate, in isolated and/or 
purified form, or free or substantially free of material 
with which it is naturally associated, such as free or 

20 substantially free of nucleic acid flanking the gene in the 
human genome, except possibly one or more regulatory 
sequence (s) for expression. Nucleic acid may be wholly or 
partially synthetic and may include genomic DNA, cDNA or 
RNA. Where nucleic acid according to the invention includes 

25 RNA, reference to the sequence shown should be construed as 
reference to the RNA equivalent, with U substituted for T. 

Nucleic acid sequences encoding a polypeptide or 
peptide in accordance with the present invention can be 
readily prepared by the skilled person using the information 

30 and references contained herein and techniques known in the 
art (for example, see Sambrook, Fritsch and Maniatis, 
"Molecular Cloning, A Laboratory Manual, Cold Spring Harbor 
Laboratory Press, 1989, and Ausubel et al. Short Protocols 
in Molecular Biology, John Wiley and Sons, 1992), given the 

35 nucleic acid sequence and clones available. These 
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techniques include (i) the use of the polymerase chain 
reaction (PGR) to amplify samples of such nucleic acid, e.g. 
from genomic sources, (ii) chemical synthesis, or (iii) 
preparing cDNA sequences. DNA encoding XRCC4 or DNA ligase 
5 IV fragments may be generated and used in any suitable way 
known to those of skill in the art, including by taking 
encoding DNA, identifying suitable restriction enzyme 
recognition sites either side of the portion to be 
expressed, and cutting out said portion from the DNA. The 
10 portion may then be operably linked to a suitable promoter 
in a standard commercially available expression system. 
Another recombinant approach is to .amplify the relevant 
portion of the DNA with suitable PGR primers. Modifications 
to the XRGG4 or DNA ligase IV sequences can be made, e.g. 
15 using site directed mutagenesis, to lead to the expression 
of modified XRGC4 or DNA ligase IV peptide or to take 
account of codon preference in the host cells used to 
express the nucleic acid. 

20 In order to obtain expression of the nucleic acid 

sequences, the sequences can be incorporated in a vector 
having one or more control sequences operably linked to the 
nucleic acid to control its expression. The vectors may 
include other sequences such as promoters or enhancers to 

25 drive the expression of the inserted nucleic acid, nucleic 
acid sequences so that the polypeptide or peptide is 
produced as a fusion and/or nucleic acid encoding secretion 
signals so that the polypeptide produced in the host cell is 
secreted from the cell. Polypeptide can then be obtained by 

30 transforming the vectors into host cells in which the vector 
is functional, culturing the host cells so that the 
polypeptide is produced and recovering the polypeptide from 
the. host cells or the surrounding medium. Prokaryotic and 
eukaryotic cells are used for this purpose in the art, 

35 including strains of E. coli, yeast, and eukaryotic cells 
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such as COS or CHO cells. 

Thus, the present invention also encompasses a method 
of making a polypeptide or peptide (as disclosed), the 
method including expression from nucleic acid encoding the 
5 polypeptide or peptide (generally nucleic acid according to 
the invention) . This may conveniently be achieved by 
growing a host cell in culture, containing such a vector, 
under appropriate conditions which cause or allow expression 
of the polypeptide. Polypeptides and peptides may also be 
10 expressed in in vitro systems, such as reticulocyte lysate. 

Systems for cloning and expression of a polypeptide in 
a variety of different host cells are well known. Suitable 
host cells include bacteria, eukaryotic cells such as 
mammalian and yeast, and baculovirus systems. Mammalian 
15 cell lines available in the art for expression of a 
heterologous polypeptide include Chinese hamster ovary 
cells, HeLa cells, baby hamster kidney cells, COS cells and 
many others. A common, preferred bacterial host is E. coli . 
Suitable vectors can be chosen or constructed, 
20 containing appropriate regulatory sequences, including 
promoter sequences, terminator fragments, polyadenylation 
sequences, enhancer sequences, marker genes and other 
sequences as appropriate. Vectors may be plasmids, viral 
e.g. 'phage, or phagemid, as appropriate. For further 
25 details see, for example. Molecular Cloning: a Laboratory 
Manual: 2nd edition, Sambrook et al., 1989, Cold Spring 
Harbor Laboratory Press . Many known techniques and 
protocols for manipulation of nucleic acid, for example in 
preparation of nucleic acid constructs, mutagenesis, 
30 sequencing, introduction of DNA into cells and gene 
expression, and analysis of proteins, are described in 
detail in Current Protocols in Molecular Biology, Ausubel et 
al . eds., John Wiley & Sons, 1992. 

Thus, a further aspect of the present invention 
35 provides a host cell containing heterologous nucleic acid as 
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disclosed herein. 

The nucleic acid of the invention may be integrated 
into the genome (e.g. chromosome) of the host cell. 
Integration may be promoted by inclusion of sequences which 
5 promote recombination with the genome, in accordance with 
standard techniques. The nucleic acid may be on an extra- 
chromosomal vector within the cell, or otherwise 
identifiably heterologous or foreign to the cell. 

A still further aspect provides a method which includes 

10 introducing the nucleic acid into a host cell. The 
introduction, which may (particularly for in vitro 
introduction) be generally referred to without limitation as 
^"transformation", may employ any available technique. For 
eukaryotic cells, suitable techniques may include calcium 

15 phosphate transf ection, DEAE-Dextran , electroporation, 
liposome-mediated transfection and transduction using 
retrovirus or other virus, e.g. vaccinia or, for insect 
cells, baculovirus. For bacterial cells, suitable 
techniques may include calcium chloride transformation, 

20 electroporation and transfection using bacteriophage. As an 
alternative, direct injection of the nucleic acid could be 
employed . 

Marker genes such as antibiotic resistance or 
sensitivity genes may be used in identifying clones 
25 containing nucleic acid of interest, as is well known in the 
art . 

The introduction may be followed by causing or allowing 
expression from the nucleic acid, e.g. by culturing host 
cells (which may include cells actually transformed although 

30 more likely the cells will be descendants of the transformed 
cells) under conditions for expression of the gene, so that 
the encoded polypeptide (or peptide) is produced. If the 
polypeptide is expressed coupled to an appropriate signal 
leader peptide it may be secreted from the cell into the 

35 culture mediiam. Following production by expression, a 
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polypeptide or peptide may be isolated and/or purified from 
the host cell and/or culture medium, as the case may be, and 
subsequently used as desired, e.g. in the formulation of a 
composition which may include one or more additional 
5 components, such as a pharmaceutical composition which 
includes one or more pharmaceutically acceptable excipients, 
vehicles or carriers (e.g. see below). 

Introduction of nucleic acid encoding a peptidyl 
10 molecule according to the present invention may take place 
in vivo by way of gene therapy, to disrupt or interfere with 
interaction between XRCC4 or DNA llgase IV 

Thus, a host cell containing nucleic acid according to 
the present invention, e.g. as a result of introduction of 
15 the nucleic acid into the cell or into an ancestor of the 
cell and/or genetic alteration of the sequence endogenous to 
the cell or ancestor (which introduction or alteration may 
take place in vivo or ex vivo), may be comprised (e.g. in 
the soma) within an organism which is an animal, 
20 particularly a mammal, which may be human or non-human, such 
as rabbit, guinea pig, rat, mouse or other rodent, cat, dog, 
pig, sheep, goat, cattle or horse, or which is a bird, such 
as a chicken. Genetically modified or transgenic animals or 
birds comprising such a cell are also provided as further 
25 aspects of the present invention. 

This may have a therapeutic aim. (Gene therapy is 
discussed below.) Also, the presence of a mutant, allele, 
derivative or variant sequence within cells of an organism, 
particularly when in place of a homologous endogenous 
30 sequence, may allow the organism to be used as a model in 
testing and/or studying substances which modulate activity 
of the encoded polypeptide In vitro or are otherwise 
indicated to be of therapeutic potential. Knock-out mice, 
for instance, may be used to test for radiosensitivity . 
35 Conveniently, however, at least preliminary assays for such 
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substances may be carried out in vitro, that is within host 
cells or in cell-free systems. Where an effect of a test 
compound is established on cellsin vitro, those cells or 
cells of the same or similar type may be grafted into an 
5 appropriate host animal for in vivo testing. 

Suitable screening methods are conventional in the art. 
They include techniques such as radioimmunosassay, 
scintillation proximetry assay and ELISA methods. Suitably 
either the XRCC4 protein or fragment or DNA ligase IV or 

10 fragment, or an analogue, derivative, variant or functional 
mimetic thereof, is immobilised whereupon the other is 
applied in the presence of the ageats under test. In a 
scintillation proximetry assay a biotinylated protein 
fragment is bound to streptavidin coated scintillant - 

15 impregnated beads {produced by Amersham) . Binding of 

radiolabelled peptide is then measured by determination of 
radioactivity induced scintillation as the radioactive 
peptide binds to the immobilized fragment. Agents which 
intercept this are thus inhibitors of the interaction. 

20 Further ways and means of screening for agents which 

modulate interaction between XRCC4 and DNA ligase IV are 
discussed below. 

In one general aspect, the present invention provides 
25 an assay method for a substance with ability to modulate, 
e.g. disrupt or interfere with interaction or binding 
between XRCC4 and DNA ligase IV, the method including: 

(a) bringing into contact a substance according to the 
invention including a peptide fragment of XRCC4 or a 
30 derivative, variant or analogue thereof as disclosed, a 
substance including the relevant fragment of DMA ligase IV 
or a variant, derivative or analogue thereof, and a test 
compound, under conditions wherein, in the absence of the 
test compound being an inhibitor of interaction or binding 
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of said substances, said substances interact or bind; and 

(b) determining interaction or binding between said 
substances . 

A test compound which disrupts, reduces, interferes 
5 with or wholly or partially abolishes binding or interaction 
between said substances (e.g. including a XRCC4 fragment and 
including a DNA ligase IV fragment), and which may modulate 
XRCC4 and/or DNA ligase IV activity, may thus be identified. 
Another general aspect of the present invention 
10 provides an assay method for a substance able to bind the 
relevant region of XRCC4 or DNA ligase IV as the case may 
be, the method including: 

(a) bringing into contact a substance which includes a 
peptide fragment of XRCC4 which interacts with DNA ligase IV 

15 as disclosed, or which includes a peptide fragment of DNA 
ligase IV which interacts with XRCC4, or a variant, 
derivative or analogue of such peptide fragment, as 
disclosed, and a test compound; and 

(b) determining binding between said substance and the 
20 test compound. 

A test compound found to bind to the relevant portion 
of XRCC4 may be tested for ability to modulate, e.g. disrupt 
or interfere with, XRCC4 interaction or binding with DNA 
ligase IV and/or ability to affect DNA ligase IV and/or 

25 XRCC4 activity or other activity mediated by XRCC4 or DNA 
ligase IV as discussed already above. 

Similarly, a test compound found to bind the relevant 
portion of DNA ligase IV may be tested for abiliy to 
modulate, e.g. disrupt or interfere with, DNA ligase IV 

30 interaction or binding with XRCC4 and/or ability to affect 
XRCC4 and/or DNA ligase IV activity or other activity 
mediated by DNA ligase IV or XRCC4 as discussed already 
above . 



35 



These aspects apply equally to interaction between DNA- 
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Pkcs/Ku and XRCC4 . Furthermore, since DNA-PKcs 
phosphorylates XRCC4, determining of such phosphorylation 
can be used in an appropriate assay. 

5 A further aspect of the present invention provides an 

assay method including 

(a) bringing into contact a substance which includes 
at least a fragment of DNA-PKcs/Ku which phosphorylates 
XRCC4, a substance which includes at least a fragment of 

10 XRCC4 including a site phosphorylated by DNA-PKcs/Ku, and a 
test compound; and 

(b) determining phosphorylation at said site. 

Of course, any suitable variant or derivative of DNA- 
PKcs/Ku and/or XRCC4 may be employed in such an assay. 

15 Phosphorylation may be determined for example by 

immobilising XRCC4 or a fragment, variant or derivative 
thereof, e.g. on a bead or plate, and detecting 
phosphorylation using an antibody or other binding molecule 
which binds the relevant site of phosphorylation with a 

20 different affinity when the site is phosphorylated from when 
the site is not phosphorylated. Such antibodies may be 
obtained by means of any standard technique as discussed 
elsewhere herein, e.g. using a phosphorylated peptide (such 
as a fragment of XRCC4) . Binding of a binding molecule 

25 which discriminates between the phosphorylated and non- 
phosphorylated form of XRCC4 or relevant fragment, variant 
or derivative thereof may be assessed using any technique 
available to those skilled in the art, which may involve 
determination of the presence of a suitable label, such as 

30 fluorescence. Phosphorylation may be determined by 

immobilisation of XRCC4 or a fragment, variant or derivative 
thereof, on a suitable substrate such as a bead or plate, 
wherein the substrate is impregnated with scintillant, such 
as in a standard scintillation proximetry assay, with 

35 phosphorylation being determined via measurement of the 
incorporation of radioactive phosphate. Rather than 
immobilising XRCC4 , its phosphorylation by DNA-PKcs/Ku may 
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be assayed by means of allowing its radio- or other 
labelling in solution, with a suitable specific binding 
member such as an antibody or DNA ligase IV or an XRCC4- 
binding fragment thereof being used to pull it out for 
5 determination of labelling. Phosphate incorporation into 
XRCC4 or a fragment, variant or derivative thereof, may be 
determined by precipitation with acid, such as 
trichloroacetic acid, and collection of the precipitate on a 
suitable material such as nitrocellulose filter paper, 
10 followed by measurement of incorporation of radiolabeled 

phosphate. SDS-PAGE separation of substrate may be employed 
followed by detection of radiolabel . 

Another general aspect of the present invention 
15 provides an assay method for a substance able to affect DNA 
ligase IV activity, the method including: 

(a) bringing into contact DNA ligase IV and a test 
compound; and 

(b) determining DNA ligase IV activity. 

20 DNA ligase IV activity may be determined in the 

presence and absence of XRCC4 to allow for an effect of a 
test compound on activity to be attributed to an effect on 
interaction between DNA ligase IV and XRCC4 . 

DNA ligase IV activity may be conveniently determined 

25 by means of its adenylation. DNA ligase IV may be incubated 
with radiolabelled ATP (e.g. as described below) or any 
suitable ATP analogue that interacts with DNA ligase IV in 
an analogous manner, so that radiolabel is incorporated into 
the ligase. (The ligase goes through an enzyme-AMP 

30 adenylated intermediate.) Such radiolabel incorporation may 
be detected by various approaches, including for example 
scintillation proximetry assay Thus, radiolabelling of DNA 
ligase IV may be determined in the presence and absence of 
test compound and in the presence and absence of XRCC4 . 

35 Pre-adenylation of DNA ligase IV with radiolabel allows for 
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assaying for discharge of the radiolabel . 

Another activity of DNA ligase IV which may be 
determined is DNA ligase IV-mediated DNA strand joining. 
For instance, two DNA molecules may be provided each of 
5 which includes a site to which a PGR primer anneals under 
appropriate conditions. When the two DNA molecules are 
covalently linked by DNA ligase IV to form a single DNA 
molecule, a PGR template results which can be amplified 
using the primers. No PGR product results in the absence of 

10 ligation. The amount of PGR product obtained in a given 
reaction can be quantitated with respect to DNA ligase 
activity. Another option is to attach a DNA molecule to an 
insoluble support and to add another, labelled DNA molecule. 
Following addition of DNA ligase IV in the presence or 

15 absence of a test compound and a washing step, attachment of 
the second molecule to the support, which can only take 
place via ligation to the DNA molecule bound to the support, 
can be determined by means of the label and related to DNA 
ligase IV activity. A further assay may include DNA end- 

20 joining, e.g. as described by Gawunder et ai., 1997. 

A substance found to be able to modulate DNA ligase IV 
activity, e.g. in the presence or absence of XRGC4, may be 
employed in a similar assay using DNA ligase I and/or DNA 
ligase III, in order to assess specificity for DNA ligase 

25 IV. 

Performance of an assay method according to the present 
invention may be followed by isolation and/or manufacture 
and/or use of a compound, substance or molecule which tests 
30 positive for ability to modulate interaction between XRCC4 
and DNA ligase IV and/or inhibit XRCG4 or DNA ligase IV 
activity or a mediated activity. 

The precise format of an assay of the invention may be 
35 varied by those of skill in the art using routine skill and 
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knowledge. For example, interaction between substances may 
be studied in vitro by labelling one with a detectable label 
and bringing it into contact with the other which has been 
immobilised on a solid support. Suitable detectable labels, 
5 especially for petidyl substances include ^^S-methionine 
which may be incorporated into recombinantly produced 
peptides and polypeptides. Recombinantly produced peptides 
and polypeptides may also be expressed as a fusion protein 
containing an epitope which can be labelled with an 
10 antibody. 

The protein which is immobilized on a solid support may 
be immobilized using an antibody against that protein bound 
to a solid support or via other technologies which are known 
per se. A preferred in vitro interaction may utilise a 
15 fusion protein including glutathione-S-transf erase (GST) . 

This may be immobilized on glutathione agarose beads. In an 
in vitro assay format of the type described above a test 
compound can be assayed by determining its ability to 
diminish the amount of labelled peptide or polypeptide which 

0 binds to the immobilized GST-fusion polypeptide. This may be 
determined by fractionating the glutathione-agarose beads by 
SDS-polyacrylamide gel electrophoresis. Alternatively, the 
beads may be rinsed to remove unbound protein and the amount 
of protein which has bound can be determined by counting the 

5 amount of label present in, for example, a suitable 
scintillation counter. 

An assay according to the present invention may also 
take the form of an in vivo assay. The in vivo assay may be 
performed in a cell line such as a yeast strain or mammalian 

0 cell line in which the relevant polypeptides or peptides are 
expressed from one or more vectors introduced into the cell. 

The ability of a test compound to modulate interaction 
or binding between XRCC4 and DNA ligase IV may be determined 
using a so-called two-hybid assay. 

5 For example, a polypeptide or peptide containing a 
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fragment of XRCC4 or DNA ligase IV as the case may be, or a 
peptidyl analogue or variant thereof as disclosed, may be 
fused to a DNA binding domain such as that of the yeast 
transcription factor GAL 4 . The GAL 4 transcription factor 
5 includes two functional domains. These domains are the DNA 
binding domain (GAL4DBD) and the GAL4 transcriptional 
activation domain (GAL4TAD) . By fusing one polypeptide or 
peptide to one of those domains and another polypeptide or 
peptide to the respective counterpart, a functional GAL 4 

10 transcription factor is restored only when two polypeptides 
or peptides of interest interact. Thus, interaction of the 
polypeptides or peptides may be measured by the use of a 
reporter gene probably linked to a GAL 4 DNA binding site 
which is capable of activating transcription of said 

15 reporter gene. This assay format is described by Fields and 
Song, 1989, Nature 340 ; 245-246. This type of assay format 
can be used in both mammalian cells and in yeast. Other 
combinations of DNA binding domain and transcriptional 
activation domain are available in the art and may be 

20 preferred, such as the LexA DNA binding domain and the VP60 
transcriptional activation domain. 

To take a Lex/VP60 two hybrid screen by way of example 
for the purpose of illustration, yeast or mammalian cells 
may be transformed with a reporter gene construction which 

25 expresses a selective marker protein (e.g. encoding p- 

galactosidase or luciferase) . The promoter of that gene is 
designed such that it contains binding site for the LexA 
DNA-binding protein. Gene expression from that plasmid is 
usually very low. Two more expression vectors may be 

30 transformed into the yeast containing the selectable marker 
expression plasmid, one containing the coding sequence for 
the full length LexA gene linked to a multiple cloning site. 
This multiple cloning site is used to clone a gene of 
interest, i.e. encoding a XRCC4 or DNA ligase IV polypeptide 

35 or peptide in accordance with the present invention, in 
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frame on to the LexA coding region. The second expression 
vector then contains the activation domain of the herpes 
simplex transactivator VP16 fused to a test peptide sequence 
or more preferably a library of sequences encoding peptides 
5 with diverse e.g. random sequences. Those two plasmids 
facilitate expression from the reporter construct containing 
the selectable marker only when the LexA fusion construct 
interacts with a polypeptide or peptide sequence derived 
from the peptide library. 

10 A modification of this when looking for peptides or 

other substances which interfere with interaction between a 
XRCC4 polypeptide or peptide and DNA ligase IV polypeptide 
or peptide, employs the XRCC4 or DNA ligase IV polypeptide 
or peptide as a fusion with the LexA DNA binding domain, and 

15 the counterpart DNA ligase IV or XRCC4 polypeptide or 
peptide as a fusion with VP60, and involves a third 
expression cassette, which may be on a separate expression 
vector, from which a peptide or a library of peptides of 
diverse and/or random sequence may be expressed.' A 

20 reduction in reporter gene expression (e.g. in the case of 
p-galactosidase a weakening of the blue colour) results from 
the presence of a peptide which disrupts the XRCC4/DNA 
ligase IV interaction, which interaction is required for 
transcriptional activation of the (3-galactosidase gene. 

25 Where a test substance is not peptidyl and may not be 
expressed from encoding nucleic acid within a said third 
expression cassette, a similar system may be employed with 
the test substance supplied exogenously. 

As noted, instead of using LexA and VP60, other similar 

30 combinations of proteins which together form a functional 
transcriptional activator may be used, such as the GAL4 DNA 
binding domain and the GAL4 transcriptional activation 
domain . 

When performing a two hybrid assay to look for 
35 substances which interfere with the interaction between two 
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polypeptides or peptides it may be preferred to use 
mammalian cells instead of yeast cells. The same principles 
apply and appropriate methods are well known to those 
skilled in the art. 

5 

The amount of test substance or compound which may be 
added to an assay of the invention will normally be 
determined by trial and error depending upon the type of 
compound used. Typically, from about 0.01 nM to 100 pM or 

10 more concentrations of putative inhibitor compound may be 
used, for example from 0.1 to 50 pM, such as about 10 pM. 
Greater concentrations may be used .when a peptide is the 
test substance. Even a molecule with weak binding may be a 
useful lead compound for further investigation and 

15 development. 

Compounds which may be used may be natural or synthetic 
chemical compounds used in drug screening programmes. 
Extracts of plants which contain several characterised or 
uncharacterised components may also be used. 

20 

Antibodies directed to the site of interaction in 
either protein form a further class of putative inhibitor 
compounds. Candidate inhibitor antibodies may be 
characterised- and their binding regions determined to 

25 provide single chain antibodies and fragments thereof which 
are responsible for disrupting the interaction. 

Antibodies may be obtained using techniques which are 
standard in the art. Methods of producing antibodies 
include immunising a mammal (e.g. mouse, rat, rabbit, horse, 

30 goat, sheep or monkey) with the protein or a fragment 

thereof. Antibodies may be obtained from immunised animals 
using any of a variety of techniques known in the art, and 
screened, preferably using binding of antibody to antigen of 
interest. For instance. Western blotting techniques or 

35 immunoprecipitation may be used (Armitage et ai., 1992, 
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Nature 357: 80-82). Isolation of antibodies and/or 
antibody-producing cells from an animal may be accompanied 
by a step of sacrificing the animal. 

As an alternative or supplement to immunising a mammal 
5 with a peptide, an antibody specific for a protein may be 
obtained from a recombinantly produced library of expressed 
immunoglobulin variable domains, e.g. using lambda 
bacteriophage or filamentous bacteriophage which display 
functional immunoglobulin binding domains on their surfaces; 
10 for instance see WO92/01047. The library may be naive, that 
is constructed from sequences obtained from an organism 
which has not been immunised with any of the proteins (or 
fragments) , or may be one constructed using sequences 
obtained from an organism which has been exposed to the 
15 antigen of interest. 

Antibodies according to the present invention may be 
modified in a number of ways. Indeed the term "antibody" 
should be construed as covering any binding substance having 
a binding domain with the required specificity. Thus the 
20 invention covers antibody fragments, derivatives, functional 
equivalents and homologues of antibodies, including 
synthetic molecules and molecules whose shape mimicks that 
of an antibody enabling it to bind an antigen or epitope. 
Example antibody fragments, capable of binding an 
25 antigen or other binding partner are the Fab fragment 
consisting of the VL, VH, CI and CHI domains; the Fd 
fragment consisting of the VH and CHI domains; the Fv 
fragment consisting of the VL and VH domains of a single arm 
of an antibody; the dAb fragment which consists of a VH 
30 domain; isolated CDR regions and F(ab')2 fragments, a 

bivalent fragment including two Fab fragments linked by a 
disulphide bridge at the hinge region. Single chain Fv 
fragments are also included. 

A hybridoma producing a monoclonal antibody according 
35 to the present invention may be subject to genetic mutation 
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or other changes. It will further be understood by those 
skilled in the art that a monoclonal antibody can be 
subjected to the techniques of recombinant DNA technology to 
produce other antibodies or chimeric molecules which retain 
5 the specificity of the original antibody. Such techniques 
may involve introducing DNA encoding the immunoglobulin 
variable region, or the complementarity determining regions 
(CDRs) , of an antibody to the constant regions, or constant 
regions plus framework regions, of a different 
10 immunoglobulin. See, for instance, EP184187A, GB 2188638A 
or EP-A-0239400. Cloning and expression of chimeric 
antibodies are described in EP-A-0I20694 and EP-A-0125023 , 
Hybridomas capable of producing antibody with desired 
binding characteristics are within the scope of the present 
15 invention, as are host cells, eukaryotic or prokaryotic, 
containing nucleic acid encoding antibodies (including 
antibody fragments) and capable of their expression. The 
invention also provides methods of production of the 
antibodies including growing a cell capable of producing the 
20 antibody under conditions in which the antibody is produced, 
and preferably secreted. 

The reactivities of antibodies on a sample may be 
determined by any appropriate means. Tagging with 
individual reporter molecules is one possibility. The 
25 reporter molecules may directly or indirectly generate 

detectable, and preferably measurable, signals. The linkage 
of reporter molecules may be directly or indirectly, 
covaiently, e.g. via a peptide bond or non-covalently . 
Linkage via a peptide bond may be as a result of recombinant 
30 expression of a gene fusion encoding antibody and reporter 
molecule . 

One favoured mode is by covalent linkage of each 
antibody with an individual f luorochrome , phosphor or laser 
dye with spectrally isolated absorption or emission 
35 characteristics. Suitable f luorochromes include 
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fluorescein, rhodamine, phycoerythrin and Texas Red. 
Suitable chromogenic dyes include diaminobenzidine . 

Other reporters include macromolecular colloidal 
particles or particulate material such as latex beads that 
5 are coloured, magnetic or paramagnetic, and biologically or 
chemically active agents that can directly or indirectly 
cause detectable signals to be visually observed, 
electronically detected or otherwise recorded. These 
molecules may be enzymes which catalyse reactions that 
10 develop or change colours or cause changes in electrical 

properties, for example. They may be molecularly excitable, 
such that electronic transitions between energy states 
result in characteristic spectral absorptions or emissions. 
They may include chemical entities used in conjunction with 
15 biosensors. Biotin/avidin or biotin/streptavidin and 
alkaline phosphatase detection systems may be employed. 

The mode of determining binding is not a feature of the 
present invention and those skilled in the art are able to 
choose a suitable mode according to their preference and 
20 general knowledge. 

Antibodies may also be used in purifying and/or 
isolating a polypeptide or peptide according to the present 
invention, for instance following production of the 
polypeptide or peptide by expression from encoding nucleic 
25 acid therefor. Antibodies may be useful in a therapeutic 

context (which may include prophylaxis) to disrupt XRCC4/DNA 
ligase IV interaction with a view to inhibiting their 
activity. Antibodies can for instance be micro-injected 
into cells, e.g. at a tumour site. Antibodies may be 
30 employed in accordance with the present invention for other 
therapeutic and non-therapeutic purposes which are discussed 
elsewhere herein. 

Other candidate inhibitor compounds may be based on 
35 modelling the 3-dimensional structure of a polypeptide or 
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peptide fragment and using rational drug design to provide 
potential inhibitor compounds with particular molecular 
shape, size and charge characteristics. 

5 A compound found to have the ability to affect XRCC4 

and/or DNA ligase IV activity has therapeutic and other 
potential in a number of contexts, as discussed. For 
therapeutic treatment such a compound may be used in 
combination with any other active substance, e.g. for anti- 

10 tumour therapy another anti-tumour compound or therapy, such 
as radiotherapy or chemotherapy. In such a case, the assay 
of the invention, when conducted in vivo, need not measure 
the degree of inhibition of binding or of modulation of DNA 
ligase IV activity caused by the compound being tested. 

15 Instead the effect on DNA repair, homologous recombination, 
cell viability, cell killing (e.g. in the presence and 
absence of radio- and/or chemo-therapy ) , retroviral 
integration, and so on, may be measured. It may be that 
such a modified assay is run in parallel with or subsequent 

20 to the main assay of the invention in order to confirm that 
any such effect is as a result of the inhibition of binding 
or interaction between XRCC4 and DNA ligase IV caused by 
said inhibitor compound and not merely a general toxic 
effect . 

25 Remember that the inventors have found interaction 

between XRCC4 and DNA-Pkcs/Ku and affecting this interaction 
is a part of the various aspects of the present invention in 
analogous fashion to affecting the interaction between XRCC4 
and DNA ligase TV. The present inventors' finding in this 

30 respect is confirmed by Leber et ai., ^^The XRCC4 gene 

product is a target for and interacts with the DNA-dependent 
protein kinase" J. Biol. Chem (1998) Jan. 16 273 (3), 1794 - 
according to information available from the World Wide Web 
on 12 January 1998. 
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An agent identified using one or more primary screens 
(e.g. in a cell-free system) as having ability to bind XRCC4 
and/or DNA ligase IV and/or modulate activity of XRCC4 
and/or DNA ligase IV may be assessed further using one or 
5 more secondary screens. A secondary screen may involve 
testing for cellular radiosensitisation and/or sensitisation 
to radiomimetic drugs, testing for impairment of V{D)J 
recombination in a transfection assay, and/or testing for 
ability to potentiate homologous recombination-mediated gene 
10 targeting. This may be tested directly and/or using a 
transfection assay that gives a read-out only after 
homologous recombination has occur r.ed, (e.g. involving co- 
transformation or co-transf ection of a cellular system with 
two plasmids that must undergo homologous recombination to 
15 yield an active reporter gene (such as luciferase or green 
fluorescent protein) , or homologous integration of 
tranfected DNA into the genome. 

Following identification of a substance or agent which 
20 modulates or affects XRCC4 and/or DNA ligase IV activity, 
the substance or agent may be investigated further. 
Furthermore, it may be manufactured and/or used in 
preparation, i.e. manufacture or formulation, of a 
composition such as a medicament, pharmaceutical composition 
25 or drug. These may be administered to individuals, e.g. for 
any of the purposes discussed elsewhere herein. 

As noted, the agent may be peptidyl, e.g. a peptide 
which includes a sequence as recited above, or may be a 
30 functional analogue of such a peptide. 

As used herein, the expression "functional analogue" 
relates to peptide variants or organic compounds having the 
same functional activity as the peptide in question, which 
may interfere with the binding between XRCC4 and DNA ligase 
35 IV. Examples of such analogues include chemical compounds 
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which are modelled to resemble the three dimensional 
structure of the XRCC4 or DNA ligase IV domain in the 
contact area, and in particular the arrangement of the key 
amino acid residues as they appear in XRCC4 or DNA ligase 
5 IV. 

In a further aspect, the present invention provides the 
use of the above substances in methods of designing or 
screening for mimetics of the substances. 

Accordingly, the present invention provides a method of 

10 designing mimetics of XRCC4 or DNA ligase IV having the 
biological activity of DNA ligase IV or XRCC4 binding or 
inhibition, the activity of allosteric inhibition of DNA 
ligase IV or XRCC4 and/or the activity of modulating, e.g. 
inhibiting, XRCC4/DNA ligase IV interaction, said method 

15 comprising: 

(i) analysing a substance having the biological 
activity to determine the amino acid residues essential and 
important for the activity to define a pharmacophore; and, 

(ii) modelling the pharmacophore to design and/or 

20 screen candidate mimetics having the biological activity. 

Suitable modelling techniques are known in the art. 
This includes the design of so-called "mimetics" which 
involves the study of the functional interactions 
fluorogenic oligonucleotide the molecules and the design of 

25 compounds which contain functional groups arranged in such a 
manner that they could reproduced those interactions. 

The designing of mimetics to a known pharmaceutically 
active compound is a known approach to the development of 

30 pharmaceuticals based on a "lead" compound. This might be 
desirable where the active compound is difficult or 
expensive to synthesise or where it is unsuitable for a 
particular method of administration, e.g. peptides are not 
well suited as active agents for oral compositions as they 

35 tend to be quickly degraded by proteases in the alimentary 
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canal. Mimetic design, synthesis and testing may be used to 
avoid randomly screening large number of molecules for a 
target property. 

There are several steps commonly taken in the design of 
5 a mimetic from a compound having a given target property. 
Firstly, the particular parts of the compound that are 
critical and/or important in determining the target property 
are determined. In the case of a peptide, this can be done 
by systematically varying the amino acid residues in the 
10 peptide, e.g. by substituting each residue in turn. These 
parts or residues constituting the active region of the 
compound are known as its "pharmacophore". 

Once the pharmacophore has been found, its structure is 
modelled to according its physical properties, e.g. 
15 stereochemistry, bonding, size and/or charge, using data 
from a range of sources, e.g. spectroscopic techniques, X- 
ray diffraction data and NMR. Computational analysis, 
similarity mapping (which models the charge and/or volume of 
a pharmacophore, rather than the bonding between atoms) and 
20 other techniques can be used in this modelling process. 

In a variant of this approach, the three-dimensional 
structure of the ligand and its binding partner are 
modelled. This can be especially useful where the ligand 
and/or binding partner change conformation on binding, 
25 allowing the model to take account of this the design of the 
mimetic . 

A template molecule is then selected onto which 
chemical groups which mimic the pharmacophore can be 
grafted. The template molecule and the chemical groups 

30 grafted on to it can conveniently be selected so that the 
mimetic is easy to synthesise, is likely to be 
pharmacologically acceptable, and does not degrade in vivo, 
while retaining the biological activity of the lead 
compound. The mimetic or mimetics found by this approach 

35 can then be screened to see whether they have the target 
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property, or to what extent they exhibit it. Further 
optimisation or modification can then be carried out to 
arrive at one or more final mimetics for in vivo or clinical 
testing . 

5 The mimetic or mimetics found by this approach can then 

be screened to see whether they have the target property, or 
to what extent they exhibit it. Further optimisation or 
modification can then be carried out to arrive at one or 
more final mimetics for In vivo or clinical testing. 
10 Mimetics of this type together with their use in 

therapy form a further aspect of the invention. 

The present invention further provides the use of a 
peptide which includes a sequence as disclosed, or a 

15 derivative, active portion, analogue, variant or mimetic, 
thereof able to bind XRCC4 or DNA ligase IV and/or modulate, 
e.g. inhibit, interaction between XRCC4 and DNA ligase IV 
and/or modulate, e.g inhibit, XRCC4 and/or DNA ligase IV 
activity, in screening for a substance able to bind DNA 

20 ligase IV and/or XRCC4, and/or modulate, e.g. inhibit, 

interaction between XRCC4 and DNA ligase IV, and/or inhibit 
XRCC4 and/or DNA ligase IV activity. 

Generally, such a substance, e.g. inhibitor, according 
to the present invention is provided in an isolated and/or 

25 purified form, i.e. substantially pure. This may include 
being in a composition where it represents at least about 
90% active ingredient, more preferably at least about 95%, 
more preferably at least about 98%. Such a composition may, 
however, include inert carrier materials or other 

30 pharmaceutically and physiologicaly acceptable excipients. 
As noted below, a composition according to the present 
invention may include in addition to an inhibitor compound 
as disclosed, one or more other molecules of therapeutic 
use, such as an anti-tumour agent. 

35 
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The present invention extends in various aspects not 
only to a substance identified as a modulator of XRCC4 and 
DNA ligase IV interaction and/or XRCC4 or DNA ligase IV- 
mediated activity, property or pathway, in accordance with 
5 what is disclosed herein, but also a pharmaceutical 
composition, medicament, drug or other composition 
comprising such a substance, a method comprising 
administration of such a composition to a patient, e.g. for 
a purpose discussed elsewhere herein, which may include 
10 preventative treatment, use of such a substance in 

manufacture of a composition for administration, e.g. for a 
purpose discussed elsewhere herein,- and a method of making a 
pharmaceutical composition comprising admixing such a 
substance with a pharmaceutically acceptable excipient, 
15 vehicle or carrier, and optionally other ingredients. 

A substance according to the present invention such as 
an inhibitor of XRCC4 and DNA ligase IV interaction or 
binding may be provided for use in a method of treatment of 
the human or animal body by therapy which affects DNA repair 
20 or other XRCC4 or DNA ligase IV-mediated activity in cells, 
e.g. tumour cells. Other purposes of a method of treatment 
employing a substance in accordance with the present 
invention are dicussed elsewhere herein. 

Thus the invention further provides a method of 
25 modulating DNA repair activity, particularly DSB end- 
joining, or other XRCC4 and/or DNA ligase IV-mediated 
activity, e.g. for a purpose discussed elsewhere herein, 
which includes administering an agent which modulates, 
inhibits or blocks the binding of XRCC4 to DNA ligase IV 
30 protein, such a method being useful in treatment where such 
modulation, inhibition or blocking is desirable. 

The invention further provides a method of treatment 
which includes administering to a patient an agent which 
interferes with the binding of XRCC4 to DNA ligase IV. 
35 Exemplary purposes of such treatment are discussed elsewhere 
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Whether it is a polypeptide, antibody, peptide, nucleic 
acid molecule, small molecule, mimetic or other 
5 pharmaceutically useful compound according to the present 
invention that is to be given to an individual, 
administration is preferably in a "prophylactically 
effective amount" or a "therapeutically effective amount" 
(as the case may be, although prophylaxis may be considered 
10 therapy) , this being sufficient to show benefit to the 

individual. The actual amount administered, and rate and 
time-course of administration, will, depend on the nature and 
severity of what is being treated. Prescription of 
treatment, e.g. decisions on dosage etc, is within the 
15 responsibility of general practioners and other medical 
doctors . 

A composition may be administered alone or in 
combination with other treatments, either simultaneously or 
sequentially dependent upon the condition to be treated. 

20 Pharmaceutical compositions according to the present 

invention, and for use in accordance with the present 
invention, may include, in addition to active ingredient, a 
pharmaceutically acceptable excipient, carrier, buffer, 
stabiliser or other materials well known to those skilled in 

25 the art. Such materials should be non-toxic and should not 
interfere with the efficacy of the active ingredient. The 
precise nature of the carrier or other material will depend 
on the route of administration, which may be oral, or by 
injection, e.g. cutaneous, subcutaneous or intravenous. 

30 Pharmaceutical compositions for oral administration may 

be in tablet, capsule, powder or liquid form, A tablet may 
include a solid carrier such as gelatin or an adjuvant. 
Liquid pharmaceutical compositions generally include a 
liquid carrier such as water, petroleum, animal or vegetable 

35 oils, mineral oil or synthetic oil. Physiological saline 
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solution, dextrose or other saccharide solution or glycols 
such as ethylene glycol, propylene glycol or polyethylene 
glycol may be included. 

For intravenous, cutaneous or subcutaneous injection, 
5 or injection at the site of affliction, the active 
ingredient will be in the form of a parenterally acceptable 
aqueous solution which is pyrogen-free and has suitable pH, 
isotonicity and stability. Those of relevant skill in the 
art are well able to prepare suitable solutions using, for 
10 example, isotonic vehicles such as Sodium Chloride 

Injection, Ringer's Injection, Lactated Ringer's Injection. 
Preservatives, stabilisers, buffers, antioxidants and/or 
other additives may be included, as required. 

Liposomes, particularly cationic liposomes, may be used 
15 in carrier formulations. 

Examples of techniques and protocols mentioned above 
can be found in Remington's Pharmaceutical Sciences, 16th 
edition, Osol, A. (ed) , 1980. 

The agent may be administered in a localised manner to 
20 a tumour site or other desired site or may be delivered in a 
manner in which it targets tumour or other cells. 

Targeting therapies may be used to deliver the active 
agent more specifically to certain types of cell, by the use 
of targeting systems such as antibody or cell specific 
25 ligands. Targeting may be desirable for a variety of 

reasons, for example if the agent is unacceptably toxic, or 
if it would otherwise require too high a dosage, or if it 
would not otherwise be able to enter the target cells. 

Instead of administering these agents directly, they 
30 may be produced in the target cells by expression from an 
encoding gene introduced into the cells, eg in a viral 
vector (a variant of the VDEPT technique - see below) . The 
vector may targeted to the specific cells to be treated, or 
it may contain regulatory elements which are switched on 
35 more or less selectively by the target cells. 
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The agent (e.g. small molecule, mimetic) may be 
administered in a precursor form, for conversion to the 
active form by an activating agent produced in, or targeted 
to, the cells to be treated. This type of approach is 
5 sometimes known as ADEPT or VDEPT, the former involving 
targeting the activator to the cells by conjugation to a 
cell-specific antibody, while the latter involves producing 
the activator, e.g. an enzyme, in a vector by expression 
from encoding DNA in a viral vector (see for example, EP-A- 
10 415731 and WO 90/07936). 

An agent may be administered in a form which is 
inactive but which is converted to .an active form in the 
body. For instance, the agent may be phosphorylated (e.g. 
to improve solubility) with the phosphate being cleaved to 
15 provide an active form of the agent in the body. 

A composition may be administered alone or in 
combination with other treatments, either simultaneously or 
sequentially dependent upon the condition to be treated, 
such as cancer, virus infection or any other condition in 
20 which a XRCC4 or DNA ligase IV-mediated effect is desirable. 
Nucleic acid according to the present invention, 
encoding a polypeptide or peptide able to modulate, e.g. 
interfere with, XRCC4 and DNA ligase IV interaction or 
binding and/or induce or modulate activity or other XRCC4 or 
25 DNA ligase IV-mediated cellular pathway or function, may be 
used in methods of gene therapy, for instance in treatment 
of individuals, e.g. with the aim of preventing or curing 
(wholly or partially) a disorder or for another purpose as 
discussed elsewhere herein. 
30 Vectors such as viral vectors have been used in the 

prior art to introduce nucleic acid into a wide variety of 
different target cells. Typically the vectors are exposed 
to the target cells so that transfection can take place in a 
sufficient proportion of the cells to provide a useful 
35 therapeutic or prophylactic effect from the expression of 
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the desired polypeptide. The transfected nucleic acid may 
be permanently incorporated into the genome of each of the 
targeted cells, providing long lasting effect, or 
alternatively the treatment may have to be repeated 
5 periodically, 

A variety of vectors, both viral vectors and plasmid 
vectors, are known in the art, see US Patent No. 5,252,479 
and WO 93/07282. In particular, a number of viruses have 
been used as gene transfer vectors, including papovaviruses, 

10 such as SV40, vaccinia virus, herpesviruses, including HSV 
and EBV, and retroviruses. Many gene therapy protocols in 
the prior art have used disabled murine retroviruses. 

As an alternative to the use of viral vectors other 
known methods of introducing nucleic acid into cells 

15 includes electroporation, calcium phosphate co- 
precipitation, mechanical techniques such as microinjection, 
transfer mediated by liposomes and direct DNA uptake and 
receptor-mediated DNA transfer. 

Receptor-mediated gene transfer, in which the nucleic 

20 acid is linked to a protein ligand via polylysine, with the 
ligand being specific for a receptor present on the surface 
of the target cells, is an example of a technique for 
specifically targeting nucleic acid to particular cells. 

25 A polypeptide, peptide or other substance able to 

mediate or interfere with the interaction of the relevant 
polypeptide, peptide or other substance as disclosed herein, 
or a nucleic acid molecule encoding a peptidyl such 
molecule, may be provided in a kit, e.g. sealed in a 

30 suitable container which protects its contents from the 

external environment. Such a kit may include instructions 
for use. 

Further aspects of the present invention arise from the 
35 fact that the work described herein provides indication that 
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mammals including humans deficient in XRCC4 arid/or DNA 
ligase IV will have immune deficiencies, heightened cancer 
predisposition, particularly lyphoreticular malignancies, 
and/or will be radiosensitive. 
5 For example, a small but significant percentage of 

human patients have disastrously debilitating (sometimes 
fatal) reactions to standard clinical doses of radiation. 
This is unfortunate, particularly where alternative modes of 
(e.g.) cancer treatment are available. The present 

10 invention allows and provides for diagnosis of such 
radiosensitive patients. 

Diagnosis of XRCC4 and/or DNA .ligase IV deficiency, 
which may be reduced ability of the particular polypeptide 
of an individual to interact with the other, or another 

15 component of a DNA repair pathway, may be used in 

conjunction with similar analysis of activity, function or 
structural integrity of other components of DNA repair 
pathways, such as Ku70, Ku80, DNA-PKcs, etc. 

20 A number of methods are known in the art for analysing 

biological samples from individuals to determine whether the 
individual carries an allele of a particular gene 
predisposing them to a particular disorder. The purpose of 
such analysis- may be used for diagnosis or prognosis, and 

25 serve to detect the presence of an existing defect (e.g. 

radiosensitivity) , to help identify the type of defect (e.g. 
a factor in a manifest clinical disorder, such as cancer) , 
to assist a physician in determining the severity or likely 
course of a disorder and/or to optimise treatment of it. 

30 Alternatively, the methods can be used to detect alleles 

that are statistically associated with a susceptibility to a 
disorder in the future, e.g. cancer, identifying individuals 
who . would benefit from regular screening to provide early 
diagnosis of the disorder, e.g. cancer. 
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For instance, oligonucleotides designed to hybridise to 
a region within the gene of interest may be used in 
diagnostic and prognostic screening. 

Oligonucleotide probes or primers, as well as the full- 
5 length gene seguence (and mutants, alleles, variants and 
derivatives) are useful in screening a test sample 
containing nucleic acid for the presence of alleles, mutants 
and variants, especially those that confer susceptibility or 
predisposition to a particular disorder, including 

10 radiosensitivity and cancers, the probes hybridising with a 
target sequence from a sample obtained from the individual 
being tested. The conditions of the hybridisation can be 
controlled to minimise non-specific binding, and preferably 
stringent to moderately stringent hybridisation conditions 

15 are preferred. The skilled person is readily able to design 
such probes, label them and devise suitable conditions for 
the hybridisation reactions, assisted by textbooks such as 
Sambrook et al (1989) and Ausubel et al (1992) . 

Nucleic acid isolated and/or purified from one or more 

20 cells (e.g. human) or a nucleic acid library derived from 
nucleic acid isolated and/or purified from cells (e.g. a 
cDNA library derived from mRNA isolated from the cells) , may 
be probed under conditions for selective hybridisation 
and/or subjected to a specific nucleic acid amplification 

25 reaction such as the polymerase chain reaction (PGR) . 

A method may include hybridisation of one or more (e.g. 
two) probes or primers to target nucleic acid. Where the 
nucleic acid is double-stranded DNA, hybridisation will 
generally be preceded by denaturation to produce single- 

30 stranded DNA. The hybridisation may be as part of a PGR 
procedure, or as part of a probing procedure not involving 
PGR. An example procedure would be a combination of PGR and 
low stringency hybridisation. A screening procedure, chosen 
from the many available to those skilled in the art, is used 

35 to identify successful hybridisation events and isolated 
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hybridised nucleic acid. 

Binding of a probe to target nucleic acid (e.g. DNA) 
may be measured using any of a variety of techniques at the 
disposal of those skilled in the art. For instance, probes 
5 may be radioactively, f luorescently or enzymatically 
labelled. Other methods not employing labelling of probe 
include examination of restriction fragment length 
polymorphisms, amplification using PGR, RNAase cleavage and 
allele specific oligonucleotide probing. 

10 Probing may employ the standard Southern blotting 

technique. For instance DNA may be extracted from cells and 
digested with different restriction enzymes. Restriction 
fragments may then be separated by electrophoresis on an 
agarose gel, before denaturation and transfer to a 

15 nitrocellulose filter. Labelled probe may be hybridised to 
the DNA fragments on the filter and binding determined. DNA 
for probing may be prepared from RNA preparations from 
cells . 

Those skilled in the art are well able to employ 
20 suitable conditions of the desired stringency for selective 
hybridisation, taking into account factors such as 
oligonucleotide length and base composition, temperature and 
so on . 

PGR techniques for the amplification of nucleic acid 
25 are described in US Patent No. 4,683,195. In general, such 
techniques require that sequence information from the ends 
of the target sequence is known to allow suitable forward 
and reverse oligonucleotide primers to be designed to be 
identical or similar to the polynucleotide sequence that is 
30 the target for the amplification. PGR comprises steps of 
denaturation of template nucleic acid (if double-stranded) , 
annealing of primer to target, and polymerisation. The 
nucleic acid probed or used as template in the amplification 
reaction may be genomic DNA, cDNA or RNA. PGR can be used to 
35 amplify specific sequences from genomic DNA, specific RNA 
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sequences and cDNA transcribed from mRNA, bacteriophage or 
plasmid sequences. References for the general use of PGR 
techniques include Mullis et al. Cold Spring Harbor Symp. 
Quant- Biol., 51:263, (1987), Ehriich (ed) , PGR technology, 
5 Stockton Press, NY, 1989, Ehriich et al. Science, 252:1643- 
1650, (1991), "PGR protocols; A Guide to Methods and 
Applications", Eds. Innis et al. Academic Press, New York, 
(1990) . 

On the basis of amino acid sequence information, 
10 oligonucleotide probes or primers may be designed, taking 
into account the degeneracy of the genetic code, and, where 
appropriate, codon usage of the organism from the candidate 
nucleic acid is derived. An oligonucleotide for use in 
nucleic acid amplification may have about 10 or fewer codons 
15 (e.g. 6, 7 or 8), i.e. be about 30 or fewer nucleotides in 
length (e.g. 18, 21 or 24). Generally specific primers are 
upwards of 14 nucleotides in length, but not more than 18- 
20. Those skilled in the art are well versed in the design 
of primers for use processes such as PGR. 

20 

A further aspect of the present invention provides an 
oligonucleotide or polynucleotide fragment of XRGC4 or DNA 
ligase IV corresponding to part of the gene coding sequence 
or a complementary sequence, in particular for use in a 

25 method of obtaining and/or screening nucleic acid. The 
sequences referred to above may be modified by addition, 
substitution, insertion or deletion of one or more 
nucleotides, but preferably without abolition of ability to 
hybridise selectively with the relevant gene sequence, that 

30 is wherein the degree of homology of the oligonucleotide or 
polynucleotide with the sequence given is sufficiently high. 

In some preferred embodiments, oligonucleotides 
according to the present invention that are fragments of the 
relevant gene sequence, in wild-type form or in the form of 

35 any allele associated with susceptibility to cancer or other 
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disorder, are at least about 10 nucleotides in length, more 
preferably at least about 15 nucleotides in length, more 
preferably at least about 20 nucleotides in length. Such 
fragments themselves individually represent aspects of the 
5 present invention. Fragments and other oligonucleotides may 
be used as primers or probes as discussed but may also be 
generated (e.g. by PGR) in methods concerned with 
determining the presence in a test sample of a sequence 
indicative of susceptibility to cancer or other disorder. 
10 Preferred probes or primers according to certain 

embodiments of this aspect of the present invention are 
designed to hybridise with and/or amplify a fragment of the 
relevant sequence (e.g. XRCC4 or DNA ligase IV) including 
any residue mutation at which is associated with cancer 
15 susceptibility. 

A number of methods are known in the art for analysing 
biological samples from individuals to determine whether the 
individual carries a gene allele with a mutation 

20 predisposing them to disease. The purpose of such analysis 
may be used for diagnosis or prognosis, and serve to detect 
the presence of, e.g., an existing cancer, to help identify 
the type of cancer, to assist a physician in determining the 
severity or likely course of the cancer and/or to optimise 

25 treatment of it. The methods may be used to detect alleles 
that are statistically associated with a susceptibility to 
cancer or other disorder in the future, e.g. early onset 
cancer, identifying individuals who would benefit from 
regular screening to provide early diagnosis of the 

30 disorder. 

Broadly, the methods divide into those screening for 
the presence of nucleic acid sequences and those that rely 
on detecting the presence or absence of polypeptide. The 
methods make use of biological samples from individuals that 
35 are suspected of contain the nucleic acid sequences or 



wo 98/30902 PCT/GB98/00095 

51 

polypeptide. Examples of biological samples include blood, 
plasma, serum, tissue samples, tumour samples, saliva and 
urine . 



5 Exemplary approaches for detecting nucleic acid or 

polypeptides include : 

(a) comparing the sequence of nucleic acid in the 
sample with a XRCC4 and/or DNA ligase IV nucleic acid 
sequence to determine whether the sample from the patient 

10 contains one or more mutations, e.g. in a particular region, 
such as a region which interacts with counterpart DNA ligase 
IV or XRCC4, as the case may be, or. other component of a DNA 
repair pathway, or, particularly in the case of DNA ligase 
IV, a catalytic region, or, 

15 (b) determining the presence in a sample of a XRCC4 

and/or DNA ligase polypeptide encoded by and, if present, 
determining whether the polypeptide includes a region 
corresponding to wild-type, and/or is mutated in such a 
region; or, 

20 (c) using DNA fingerprinting to compare the restriction 

pattern produced when a restriction enzyme cuts a sample of 
nucleic acid from the patient with the restriction pattern 
obtained from a particular region corresponding to that for 
the normal gene or from known mutations thereof; or, 

25 (d) using a specific binding member capable of binding 

to a nucleic acid sequence (either a normal sequence or a 
known mutated sequence) encoding a particular polypeptide 
fragment, the specific binding member comprising nucleic 
acid hybridisable with the relevant sequence, or substances 

30 comprising an antibody domain with specificity for a native 
or mutated polypeptide fragment nucleic acid sequence or the 
polypeptide encoded by it, the specific binding member being 
labelled so that binding of the specific binding member to 
its binding partner is detectable; or, 

35 (e) using PGR involving one or more primers based on 
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the relevant normal or mutated gene sequence to screen for 
normal or mutant sequences within a particular region of the 
gene in a sample. 

A ''^specific binding pair" in such a context may 
5 comprise a specific binding member (sbm) and a binding 
partner (bp) which have a particular specificity for each 
other and which in normal conditions bind to each other in 
preference to other molecules. Examples of specific binding 
pairs are antigens and antibodies (see above), molecules and 

10 receptors and complementary nucleotide sequences. The 

skilled person will be able to think of many other examples 
and they do not need to be listed here. Further, the term 
"specific binding pair" is also applicable where either or 
both of the specific binding member and the binding partner 

15 comprise a part of a larger molecule. In embodiments in 
which the specific binding pair are nucleic acid sequences, 
they will be of a length to hybridise to each other under 
the conditions of the assay, preferably greater than 10 
nucleotides long, more preferably greater than 15 or 20 

20 nucleotides long. 

In most embodiments for screening for susceptibility 
alleles, the relevant nucleic acid (e.g. encoding XRCC4 
and/or DNA ligase IV) in the sample will initially be 
amplified, e.g. using PGR, to increase the amount of the 

25 analyte as compared to other sequences present in the 

sample. This allows the target sequences to be detected 
with a high degree of sensitivity if they are present in the 
sample. This initial step may be avoided by using highly 
sensitive array techniques that are becoming increasingly 

30 important in the art. 

To reiterate in further detail, the identification of 
biochemical activity and physiological function of XRCC4 and 
DNA ligase IV and particular regions thereof paves the way 
35 for aspects of the present invention to provide the use of 
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materials and methods, such as are disclosed and discussed 
above, for establishing the presence or absence in a test 
sample of an variant form of the gene, in particular an 
allele or variant specifically associated with cancer or 
5 other disorder such as radiosensitivity , as discussed. This 
may be for diagnosing a predisposition of an individual to a 
disorder. It may be for diagnosing a patient with a 
disorder as being associated with the gene. 

This allows for planning of appropriate therapeutic 

10 and/or prophylactic treatment, permitting stream-lining of 
treatment by targeting those most likely to benefit. 

A variant form of the gene may contain one or more 
insertions, deletions, substitutions and/or additions of one 
or more nucleotides compared with the wild-type sequence 

15 which may or may not disrupt the transcriptional activation 
function of the region examined herein. Differences at the 
nucleic acid level are not necessarily reflected by a 
difference in the amino acid sequence of the encoded 
polypeptide. However, a mutation or other difference in a 

20 gene may result in a frame-shift or stop codon, which could 
seriously affect the nature of the polypeptide produced, or 
a point mutation or gross mutational change to the encoded 
polypeptide, including insertion, deletion, substitution 
and/or addition of one or more amino acids or regions in the 

25 polypeptide, which may affect transcriptional activation. 

There are various methods for determining the presence 
or absence in a test sample of a particular nucleic acid 
sequence, such as a sequence for XRCC4 or DNA ligase IV, or 
30 a fragment, mutant, variant or allele thereof. 

Tests may be carried out on preparations containing 
genomic DNA, cDNA and/ or mRNA. Testing cDNA or mRNA has the 
advantage of the complexity of the nucleic acid being 
reduced by the absence of intron sequences, but the possible 
35 disadvantage of extra time and effort being required in 
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making the preparations. RNA is more difficult to 
manipulate than DNA because of the wide-spread occurrence of 
RN' ases . 

Nucleic acid in a test sample may be sequenced and the 
5 sequence compared with the relevant wild-tpye sequence to 
determine whether or not a difference is present. If so, 
the difference can be compared with known susceptibility 
alleles, to determine whether the test nucleic acid contains 
one or more of the variations indicated, or the difference 
10 can be investigated for association with the disorder of 
interest . 

Since it will not generally be time- or labour- 
efficient to sequence all nucleic acid in a test sample or 
even the whole gene for XRCC4 or DNA ligase IV, a specific 

15 amplification reaction such as PGR using one or more pairs 
of primers may be employed to amplify the region of interest 
in the nucleic aci . The amplified nucleic acid may then be 
sequenced as above, and/or tested in any other way to 
determine the presence or absence of a particular feature. 

20 Nucleic acid for testing may be prepared from nucleic acid 
removed from cells or in a library using a variety of other 
techniques such as restriction enzyme digest and 
electrophoresis . 

Nucleic acid may be screened using a variant- or 

25 allele-specif ic probe. Such a probe corresponds in sequence 
to a region of the relevant gene, or its complement, 
containing a sequence alteration known to be associated with 
susceptibility to cancer or other disorder of interest. 
Under suitably stringent conditions, specific hybridisation 

30 of such a probe to test nucleic acid is indicative of the 
presence of the sequence alteration in the test nucleic 
acid. For efficient screening purposes, more than one probe 
may. be used on the same test sample. 

Allele- or variant-specific oligonucleotides may 

35 similarly be used in PGR to specifically amplify particular 
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sequences if present in a test sample. Assessment of whether 
a PCR band contains a gene variant may be carried out in a 
number of ways familiar to those skilled in the art. The 
PCR product may for instance be treated in a way that 
5 enables one to display the mutation or polymorphism on a 
denaturing polyacrylamide DNA sequencing gel, with specific 
bands that are linked to the gene variants being selected. 

SSCP heteroduplex analysis may be used for screening 
DNA fragments for sequence variants/mutations. It generally 

10 involves amplifying radiolabelled 100-300 bp fragments of 
the gene, diluting these products and denaturing at 95°C. 
The fragments are quick-cooled on ice so that the DNA 
remains in single stranded form. These -'=:ingle stranded 
fragments are run through acrylamide based gels. 

15 Differences in the sequence composition will cause the 

single stranded molecules to adopt difference conformations 
in this gel matrix making their mobility different from wild 
type fragments, thus allowing detecting of mutations in the 
fragments being analysed relative to a control fragment upon 

20 exposure of the gel to X-ray film. 

Fragments with altered mobility/conformations may be 
directly excised from the gel and directly sequenced for 
mutation . 

Sequencing of a PCR product may involve precipitation 
25 with isopropanol, resuspension and sequencing using a TaqFS+ 
Dye terminator sequencing kit. Extension products may be 
electrophoresed on an ABI 377 DNA sequencer and data 
analysed using Sequence Navigator software. 

A further possible screening approach employs a PTT 
30 assay in which fragments are amplified with primers that 
contain the consensus Kozak initiation sequences and a T7 
RNA polymerase promoter. These extra sequences are 
incorporated into the 5* primer such that they are in frame 
with the native coding sequence of the fragment being 
35 analysed. These PCR products are introduced into a coupled 



wo 98/30902 PCT/GB98/00095 

56 

transcription/translation system. This reaction allows the 
production of RNA from the fragment and translation of this 
RNA into a protein fragment. PGR products from controls 
make a protein product of a wild type size relative to the 
5 size of the fragment being analysed. If the PGR product 
analysed has a frame-shift or nonsense mutation, the assay 
will yield a truncated protein product relative to controls. 
The size of the truncated product is related to the position 
of the mutation, and the relative region of the gene from 
10 this patient may be sequenced to identify the truncating 
mutation - 

An alternative or supplement t.o looking for the 
presence of variant sequences in a test sample is to look 
for the presence of the normal sequence, e.g. using a 
15 suitably specific oligonucleotide probe or primer. 

Approaches which rely on hybridisation between a probe 
and test nucleic acid and subsequent detection of a mismatch 
may be employed. Under appropriate conditions (temperature, 
pH etc.), an oligonucleotide probe will hybridise with a 
20 sequence which is not entirely complementary. The degree of 
base-pairing between the two molecules will be sufficient 
for them to anneal despite a mis-match. Various approaches 
are well known in the art for detecting the presence of a 
mis-match between two annealing nucleic acid molecules. 
25 For instance, RN' ase A cleaves at the site of a mis- 

match. Cleavage can be detected by electrophoresing test 
nucleic acid to which the relevant probe or probe has 
annealed and looking for smaller molecules (i.e. molecules 
with higher electrophoretic mobility) than the full length 
30 probe/test hybrid. Other approaches rely on the use of 
enzymes such as resolvases or endonucleases . 

Thus, an oligonucleotide probe that has the sequence of 
a region of the normal gene (either sense or anti-sense 
strand) in which at least one mutation associated with, 
35 e.g., cancer susceptibility is known to occur, may be 
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annealed to test nucleic acid and the presence or absence of 
a mis-match determined. Detection of the presence of a mis- 
match may indicate the presence in the test nucleic acid of 
a mutation associated with, e.g., cancer susceptibility. On 
5 the other hand, an oligonucleotide probe that has the 
sequence of a region of the gene including a mutation 
associated with, e.g., cancer susceptibility may be annealed 
to test nucleic acid and the presence or absence of a mis- 
match determined. The presence of a mis-match may indicate 

10 that the nucleic acid in the test sample has the normal 

sequence. In either case, a battery of probes to different 
regions of the gene may be employed. Indeed, probes may be 
included with probes or other materials for other genes for 
stream-lined testing. 

15 The presence of differences in sequence of nucleic acid 

molecules may be detected by means of restriction enzyme 
digestion, such as in a method of DNA fingerprinting where 
the restriction pattern produced when one or more 
restriction enzymes are used to cut a sample of nucleic acid 
20 is compared with the pattern obtained when a sample 
containing the normal gene or a variant or allele is 
digested with the same enzyme or enzymes. 

A test sample of nucleic acid may be provided for 
example by extracting nucleic acid from cells, e.g. in 
25 saliva or preferably blood, or for pre-natal testing from 
the amnion, placenta or foetus itself. 

Nucleic acid according to the present invention, such 
as a full-length coding sequence or oligonucleotide probe or 

30 primer, may be provided as part of a kit, e.g. in a suitable 
container such as a vial in which the contents are protected 
from the external environment. The kit may include 
instructions for use of the nucleic acid, e.g. in PGR and/or 
a method for determining the presence of nucleic acid of 

35 interest in a test sample. A kit wherein the nucleic acid 
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is intended for use in PGR may include one or more other 
reagents required for the reaction, such as polymerase, 
nucleosides, buffer solution etc. The nucleic acid may be 
labelled. A kiit for use in determining the presence or 
5 absence of nucleic acid of interest may include one or more 
articles and/or reagents for performance of the method, such 
as means for providing the test sample itself, e.g. a swab 
for removing cells from the buccal cavity or a syringe for 
removing a blood sample (such components generally being 

10 sterile) . In a further aspect, the present invention 
provides an apparatus for screening for XRCC4 and/or DNA 
ligase IV nucleic acid, the apparatus comprising storage 
means including the relevant gene nucleic acid sequence, or 
a fragment thereof, the stored sequence being used to 

15 compare the sequence of the test nucleic acid to determine 
the presence of mutations. 

There are various methods for determining the presence 
or absence in a test sample of a particular polypeptide, 

20 such as a polypeptide including a fragment of XRCC4 or DNA 
ligase IV corresponding to a particular region involved in 
interaction with counterpart DNA ligase IV or XRCC4 , as the 
case may be, involved in interaction with one or more other 
proteins or components of a DNA repair pathway, or having a 

25 particular biological activity, such as DNA ligase enzymatic 
activity. 

A sample may be tested for the presence of a binding 
partner for a specific binding member such as an antibody 
(or mixture of antibodies) , specific for one or more 
30 particular variants of the polypeptide, i.e. wild-type or a 
mutant, variant or allele thereof. 

A sample may be tested for the presence of a binding 
partner for a specific binding member such as an antibody 
(or mixture of antibodies), specific for the polypeptide. 
35 In such cases, the sample may be tested by being 
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contacted with a specific binding member such as an antibody 
under appropriate conditions for specific binding, before 
binding is determined, for instance using a reporter system 
as discussed. Where a panel of antibodies is used, 
5 different reporting labels may be employed for each antibody 
so that binding of each can be determined. 

A specific binding member such as an antibody may be 
used to isolate and/or purify its binding partner 
polypeptide from a test sample, to allow for sequence and/or 
10 biochemical analysis of the polypeptide to determine whether 
it has the sequence and/or properties of the polypeptide of 
interest, or if it is a mutant or variant form. Amino acid 
sequence is routine in the art using automated sequencing 
machines . 

15 

Protein may be detected using Western blots, and also 
Far-Western blots in which a non-antibody protein is used. 
For instance, XRCC4 could be used to determine the presence 
of DNA ligase IV, and vice versa. Immunoprecipitation, 
20 radio-immunoassay, ELISA and other standard approaches in 
the art may be employed, using antibodies and other 
appropriate specific binding agents. 

Various further aspects and embodiments of the present 
25 invention will be apparent to those skilled in the art in 
view of the present disclosure. Certain aspects and 
embodiments of the invention will now be illustrated by way 
of example and with reference to the figures discussed 
already above. 

30 

EXAMPLE 1 

DETERMINATION OF BIOLOGICAL ACTIVITY OF XRCC4 



Generation of antlsera that recognise XRCC4 
35 With the aim of gaining insights into the mechanism of 
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XRCC4 action, it was decided to try to characterise the 
human protein biochemically. Towards this end, full-length 
human XRCC4 and the C-terminal region of XRCC4 comprising 
residues 201-344 were expressed in Escherichia coli as hexa- 
5 histidine-tagged proteins. After purification to 
homogeneity, each antigen was then used to raise polyclonal 
antisera in rabbits . 

In the course of these studies, we observed that 
recombinant full-length XRCC4 runs anomolously upon SDS- 
LO PAGE, with an apparent molecular mass of -55 kDa, which is 
considerably larger than the predicted molecular weight of 
38 kDa. Untagged and His-tagged versions of XRCC4 were 
found to behave similarly. Although the reason for this is 
currently unclear, this might reflect the fact that XRCC4 
.5 contains an unusually large proportion of glutamic acid 
amino acid residues, increasing the negative charge of the 
protein. One possible result of this would be a net 
decrease in the amount of SDS bound to the protein which 
would decrease the mobility of XRCC4 upon SDS-PAGE analysis. 
0 Western blot analyses revealed that each of the anti- 

XRCC4 antisera raised was capable of recognising less than 1 
ng of recombinant XRCC4 protein. To establish whether these 
antisera are capable of detecting endogenous XRCC4 in 
mammalian cell lysates, crude HeLa cell nuclear extracts 
5 were subjected to SDS-PAGE followed by Western iromunoblot 
analysis . 

Importantly, each antiserum, but none of the pre-immune 
sera, was found to recognise a HeLa cell protein of 55-60 
kDa, which is in good agreement with the size of recombinant 

0 XRCC4 . In addition, each immune serum also detects several 
other polypeptides weakly. Although the identities of these 
are not established, some may correspond to alternative 
forms of XRCC4 or its proteolytic degradation products. For 
instance, one band in particular is likely to represent a N- 

5 terminal XRCC4 proteolytic product because it is recognised 
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by all sera raised against the full-length protein but not 
by serum SJ5 that was raised against the XRCC4 C-terminal 
region. Interestingly, despite the high degree of sequence 
conservation between XRCC4 in rodents and humans (Li et ai., 
5 1995), we have been unable to detect XRCC4 in extracts of 
mouse or hamster cells by direct Western blotting using 
these antibodies. This could in part reflect low 
immunological cross-reactivity between the human and rodent 
proteins. However, given the evolutionary conservation of 

10 XRCC4, the model that we currently favour is that, as is the 
case for other DNA DSB repair factors, such as Ku and DNA- 
PKcs (Blunt et ai., 1995; Finnie et ai., 1995; Danska et 
al.r 1996), XRCC4 is expressed at much lower levels in 
rodent cells than in human cells (also, see below) . 

15 To enhance further the specificity of anti-XRCC4 

antiserum SJ4B, this was subjected to immuno-af f inity 
chromatography using XRCC4 that had been attached covalently 
to Sepharose beads. Significantly, whereas the crude serum 
recognises a number of polypeptides in HeLa whole cell 

20 extracts in addition to full-length XRCC4 , much of the 

reactivity towards the other proteins is recovered in the 
flow-through fractions, resulting in the affinity-purified 
antibody material (eluate) having improved specificity and 
selectivity as compared to the unf ractionated serum. 

25 

XRCC4 is a nuclear phosphoprotein and serves as an effective 
substrate for DNA-PK in vitro 

As a first step towards establishing the biochemical 
function of XRCC4 , we decided to try to determine its sub- 

30 cellular localisation. Nuclear and cytosolic fractions were 
prepared from HeLa cells and were subjected to Western blot 
analysis using affinity-purified XRCC4 antibody SJ4B, The 
integrity of the fractions was established by also probing 
with antisera against Spl which is located predominantly in 

35 the nuclear fraction. 
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Notably, these studies revealed that XRCC4 is present 
in the nuclear extract, with low amounts being detectable in 
the cytosolic fraction. These data therefore reveal that 
XRCC4 is a nuclear protein and are consistent with models in 
5 which XRCC4 serves as part of a DNA DSB repair apparatus, 
for instance as illustrated in Figure 5. 

In this model (which is proposed without in any way 
limiting the nature or scope of any aspect of the present 
invention or embodiment thereof), Ku binds to the free DNA 

10 ends and recruits DNA-PKcs, activating the kinase catalytic 
function of the latter. A DNA ligase IV/XRCC4 complex is 
then recruited to the DNA DSB. Ona or more additional 
components may be involved: some possibilities are indicated 
by means of question marks. Active DNA-PK may also trigger 

15 DNA damage signalling events or may phosphorylate other DNA 
DSB repair components, such as XRCC4 as has been 
demonstrated by the inventors, thus regulating their 
activity. The stoichiometry of the XRCC4-DNA ligase IV 
complex is for the purpose of illustration only. XRCC4 may 

20 interact with DNA ligase IV anywhere within residues 550-884 
of human DNA ligase IV, e.g. at or between the BRCT domains. 

During the course of the above studies, we observed 
that HeLa XRCC4 reproducibly migrates more slowly than 

25 recombinant XRCC4 on SDS-PAGE, suggesting that human XRCC4 
is modified post-translationally , To determine whether this 
reflects XRCC4 phosphorylation, HeLa nuclear extract was 
either mock-treated, treated with K protein phosphatase, or 
was treated with \ phosphatase in the presence of 

30 phosphatase inhibitors. 

Significantly, Western analysis of these samples 
j:evealed that X phosphatase increases the SDS-PAGE mobility 
of HeLa XRCC4 so that it is now equivalent to that of the 
recombinant protein, whereas this effect is abrogated by 

35 phosphatase inhibitors. These data therefore reveal that 
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XRCC4 is phosphorylated to high stoichiometry in HeLa cell 
extracts and suggest that this modification is used to 
modulate XRCC4 activity in vivo. 

In light of this, and because cells deficient in XRCC4 
5 have very similar phenotypes to those defective in 
components of DNA-PK, we tested whether DNA-PK is able to 
phosphorylate XRCC4 in vitro. XRCC4 indeed serves as an 
effective substrate for DNA-dependent phosphorylation by 
DNA-PK, so DNA-PK may control XRCC4 activity within the 
10 cell. 

Endogenous XRCC4 appears to be complexed with another 
protein (s) 

There are various ways in which XRCC4 might function in 

15 DNA DSB repair and V(D)J recombination. 

One possibility the inventors considered is that it 
interacts directly with DNA and plays a role in repairing 
DNA damage or signalling its presence to the cell. However, 
we have been unable to detect binding of recombinant XRCC4 

20 to various DNA species in electrophoretic mobility shift 
assays. Furthermore, when HeLa nuclear extracts are passed 
through DNA-agarose columns under salt concentrations in 
which many DNA binding proteins are retained, the majority 
of endogenous XRCC4 flows through. 

25 These data therefore argue that XRCC4 does not bind 

avidly to DNA. 

Another possible role for XRCC4 considered by the 
inventors is for it to interact with another component of 
the DNA DSB repair apparatus. As an approach to address 

30 this idea, we investigated the biochemical fractionation of 
XRCC4 and other known and potential DNA DSB repair factors 
upon gel-filtration chromatography on Superose-6. To 
disrupt possible non-specific protein-protein associations, 
such experiments were performed under stringent conditions 

35 of 1 M NaCl. 



wo 98/30902 PCT/GB98/00095 

64 

These studies revealed that recombinant untagged XRCC4 
elutes in a manner consistent with a mass of just over 66 
kDa, which is larger than the predicted XRCC4 monomer 
molecular weight (Li at ai., 1995) and its apparent mass as 
5 determined by SDS-PAGE. This suggests either that XRCC4 is 
a monomeric protein with shape characteristics causing it to 
behave anomolously upon gel-filtration, or exists in 
solution as a multimer, most likely a dimer. 

10 Most significantly, gel-filtration analysis of HeLa 

nuclear extract in the presence of 1 M NaCl reveals that 
endogenous XRCC4 fractionates in a manner consistent with a 
molecular mass of around 200 kDa, which is markedly higher 
than that for recombinant XRCC4 . These data therefore 
15 suggest strongly that HeLa XRCC4 is associated with another 
protein (s). We took the same set of gel-filtration 
fractions tested above for XRCC4 and examined them for the 
presence of Ku, DNA-PK^s. and DNA ligases I, III and IV. 
Significantly, although some overlap was evident in 
20 each case, the XRCC4 elution profile did not parallel those 
exhibited for DNA ligase I, Ku or DNA-PK^,^. Thus, ligase I 
peaked at -150 kDa which is slightly larger than the 
predicted monomer molecular weight of 1-2 kOA, DNA-PK^^ (4 65 
kOA) eluted at around 200 kDa which may indicate that the 
25 tertiary structure of DNA-PK is disrupted under these 

conditions, and Ku elution peaked at -150 kDa, consistent 
with the predicted size of a Ku70/Ku80 heterodimer. 

In marked contrast, the elution profile of XRCC4 was 
found to be virtually identical to those of DNA ligases III 
30 and IV. These results therefore raised the possibility that 
XRCC4 exists in stable association with either DNA ligase 
III or IV. 



HeLa cell XRCC4 co-lmmunopreclpitates with DNA ligase IV 
35 To further test for possible interactions between XRCC4 
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and the factors described above, we iiranunoprecipitated XRCC4 
from its peak gel-filtration fractions in the presence of 1 
M NaCl and 50 pg/ml ethidium bromide (to abolish non- 
specific interaction mediated via DNA) , and tested the 
5 resulting precipitated material for the presence of Ku, DNA- 
PK ligases I, III, and IV. Significantly, Western 

immunobiot analyses revealed that DNA-PK^^g, Ku, and DNA 
ligase I that were present in the XRCC4 fractions did not 
co-immunoprecipitate with XRCC4, confirming that XRCC4 does 
10 not interact stably with any of these factors under these 
assay conditions. Note, however, that as discussed 
elsewhere herein, the inventors have established that DNA- 
PKcs/Ku is able to interact with XRCC4 under other 
conditions. They have shown also that it phosphorylates 
15 XRCC4, which of course requires some interaction between the 
proteins. This is confirmed by Leber et ai. , "The XRCC4 
gene product is a target for and interacts with the DNA- 
dependent protein kinase" J. Biol. Chem (1998) Jan. 16 273 
(3) , 1794 - according to information available from the 
20 World Wide Web on 12 January 1998. 

To assay for possible interactions between XRCC4 and a 
DNA ligase enzyme, we employed the fact that mammalian DNA 
ligases form covalently-linked adenylate complexes 
(Tomkinson et ai., 1991; Wei et ai . , 1995; Danska et ai . , 
25 1996; Robins and Lindahl, 1996) . When the XRCC4-containing 
gel-filtration fraction was incubated with [a-^^P]-ATP and 
was then examined by SDS-PAGE followed by autoradiography, 
adenylated proteins of approximately 120 kDa and 100 kDa 
were detected, which correspond to DNA ligase I and a 
30 mixture of DNA ligases III and IV, respectively. 

To see whether any of these ligases associate with 
XRCC4, uniabelled extract was incubated with pre-immune or 
anti-XRCC4 antisera in the presence of 1 M NaCl then, after 
stringent washing, the immunoprecipitated material was 
35 incubated with [a-^^P]-ATP and tested for radioactively- 
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labelled adenylated proteins. 

Significantly, these studies revealed that an 
adenylated protein species of -100 kDa, corresponding to DNA 
ligase III and/or IV is immunoprecipitated efficiently by 
5 the affinity purified XRCC4 antiserum but not by pre-immune 
sera. By contrast, the adenylated species corresponding to 
DNA ligase I is not recovered. Importantly, and consistent 
with the fact that the adenylate moiety of adenylated-DNA 
ligase complexes is discharged in the presence of ligatable 
10 polynucleotide substrates, the radiolabel associated with 
the XRCC4-precipitated material is lost upon incubation in 
the presence of DNA that has been nicked by treatment with 
DNase I . 

To rule out the possibility that the immunoprecipitated 
15 ligase was being recognised directly by the anti-XRCC4 
antiserum, we performed parallel immunoprecipitation 
reactions on extracts derived from the hamster cell lines Kl 
and XR-1, which contain and lack XRCC4 protein, 
respectively. Importantly, the -100 kDa adenylated ligase 
20 species is recovered from Kl extracts but not from XR-1 
extracts. These data therefore reveal that the ligase is 
not recognised by the antiserum directly and, instead, is 
immunoprecipitated via its association with XRCC4 . 

25 Taken together, the above results reveal that XRCC4 

forms a tight salt-stable interaction with DNA ligase III 
and/or DNA ligase IV. To establish which of these two 
enzymes is associated with XRCC4 , we took advantage of the 
fact that ligases III and IV have different abilities to 

30 join single-strand breaks in polynucleotide substrates 

containing one DNA strand and one RNA strand. Thus, whereas 
DNA ligase III can catalyse joining in both 

oligo(rA)«poly (dT) and oligo (dT) -poly ( rA) substrates, ligase 
IV is only able to mediate joining of the latter (Robins, 
35 1996) . 
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In light of this, we performed adenylation assays on 
material immunoprecipitated with XRCC4 and then incubated 
the labelled immunoprecipitates with either 
oligo(dT)»poly (rA) or oligo (rA) -poly (dT) . Notably, only 
5 oligo (dT) -poly (rA) resulted in dissociation of the adenylate 
group from the ligase that is immunoprecipitated with XRCC4 . 
These results therefore suggest strongly that XRCC4 
interacts tightly and specifically with DNA ligase IV but 
not with DNA ligase III. 

10 

XRCC4 and ligase IV co-purify extensively 

To confirm the interaction between XRCC4 and DNA ligase 
IV, and to gain insight into what proportion of the two 
proteins exists in this complex, we purified DNA ligase IV 
15 using established protocols (Robins, 1996) and tested for 
the presence of ligase IV and XRCC4 by quantitative Western 
immunoblot analyses at each chromatographic stage. 

Fractions collected from the chromatographic columns 
were analysed on SDS-polyacrylamide gels and specific 
20 binding of antibodies was detected by immunoblots. The 

amount of specified protein in each fraction was quantitated 
from densitometric scans of the gels. The amount of each 
protein in analysed fractions as a proportion of its total 
amount was plotted for the samples separated by gel 
25 chromatography fractionation (Figure lA) , followed by a Mono 
S column (Figure IB) . 

As demonstrated previously, we observed that DNA 
ligases III and IV co-elute during gel filtration 
chromatography (Figure lA) but are resolved from one another 
30 by chromatography on Mono-S (Figure IB) . Significantly, 

XRCC4 tracks with ligase IV throughout these procedures but, 
in contrast, becomes separated from DNA ligase III at the 
Mono-S chromatography step (Figures lA and IB) . 
Furthermore, XRCC4 is present even in more highly purified 
35 samples of DNA ligase IV generated via subsequent 
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chromatography on Mono Q, and iiranunoblot analyses of near 
homogenous ligase IV preparations that we had prepared 
previously demonstrates the existence of XRCC4 . Samples 
further purified on a Mono Q column (Robins, 1996) were 
5 analysed on a 10% SDS-polyacrylamide gel by immunoblots 
testing for the presence of XRCC4 or DNA ligase IV. 
Molecular sizes were estimated from the migration of 
Kaleidoscope pre~stained markers. 

In additional studies, we have also observed that XRCC4 
10 and DNA ligase IV co-purify on phenyl-Sepharose . Indeed, 

short of incubation with harsh ionic detergents, we have yet 
to find a procedure to separate these two proteins. 

Interestingly, the XRCC4 that co-purifies with ligase 
IV corresponds to the phosphorylated form of the protein as 
15 evidenced by its SDS-PAGE mobility and by the fact that this 
mobility is increased by phosphatase treatment. 

Finally, it is particularly noteworthy that XRCC4 and 
ligase IV co-purify almost quantitatively with one another, 
and that no free pools of either factor are evident {for 
20 example, see Figures lA and IB) . This therefore suggests 
that XRCC4 and DNA ligase IV are present at similar levels 
in the cell and that virtually all of each polypeptide 
exists in a complex with its partner. 

25 XRCC4 interacts with the C-terminal portion of DNA ligase IV 
that comprises two BRCT homology domains 

To gain insights into the basis for the highly specific 
binding of DNA ligase IV to XRCC4 , we decided to try to 
determine which region (s) of ligase IV are involved in this 

30 interaction. 

DNA ligase I, III and IV display high levels of 
sequence similarity to one another within a section that has 
been defined as the core ligase catalytic domain (Wei et 
ai., 1995). In addition, each ligase possesses discrete N- 

35 and/or C-terminal extensions that have been proposed to 
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confer unique properties on the three enzymes. 
Significantly, although the C-terminal extensions of DNA 
ligases III and IV show very little homology with one 
another at the primary sequence level, they possess one and 
5 two copies, respectively, of the recently identified BRCT 
homology domain (Koonin et ai., 1996; Callebaut and Mornon, 
1997) ; see Discussion below) . 

With the aim of finding which region (s) of ligase IV 

10 interacts with XRCC4 , we divided the ligase IV polypeptide 
into three portions - an N-terminal region (corresponding to 
amino acid residues 1-198) that exhibits homology with 
ligase I and III, a central region {residues 199-549) that 
shows highest levels of homology with ligase I and III and 

15 which contains the ligase catalytic site, and a C-terminal 
region (residues 550-844) that contains the two BRCT 
homology domains. After transcribing and translating the 
three regions separately In vitro, these were tested for an 
ability to bind to Sepharose beads or to Sepharose beads 

20 containing covalently-attached XRCC4 . Samples of LigIV(l- 
198), Lig IV(199-549), Lig IV(550-844) and luciferase were 
applied to XRCC4-Sepharose beads or negative control beads 
(ON) and unbound proteins collected (FT) . After washes with 
either 0.1 M and 1.0 M NaCl, bound proteins were eluted with 

25 gel loading buffer (SDS) . After SDS-PAGE, the 

[•^^S]methionine-labelled fragments were detected by 
autoradiography. 

The N-terminal and central fragments of DNA ligase IV 
fail to bind detectably to the XRCC4-beads, as is the case 

30 for the luciferase protein that was employed as a control. 
In marked contrast, the ligase IV C-terminal fragment is 
retained almost quantitatively on the XRCC4-beads but not on 
control beads lacking XRCC4. Moreover, the binding of the 
C-terminal portion of ligase IV to XRCC4 appears to be very 

35 strong, as evidenced by the fact that the ligase IV C- 
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terminal region is not eluted by washing at 1 M NaCl and is 
only recovered following addition of the ionic detergent 
SDS. 

Finally, to further address the specificity of the 
5 above interaction, we assessed whether XRCC4-containing 
beads could be used^to purify the C-terminal region of 
ligase IV from crude bacterial lysates. To do this, an 
unf ractionated extract of E. coll expressing this region 
{LIGIV (550-844)) at fairly low levels was incubated with 
10 XRCC4-containing Sepharose beads and unbound proteins 

collected. Bound material was then eluted with step-wise 
increases in salt concentrations, followed by a final 
elution in the presence of gel loading buffer, SDS. 

Strikingly, as shown by total Coomassie-blue protein 
15 staining of an SDS-polyacrylamide gel containing these 

fractions, this procedure results in the C-terminal region 
of ligase IV being purified to virtual homogeneity in a 
single step. The identity of this polypeptide as the ligase 
IV C-terminus was confirmed by Western blot analyses, and 
20 this protein was not retained by Sepharose beads alone. 

Taken together, these results attest to the extreme 
strength and specificity of the interaction between the 
ligase IV C-terminal region and XRCC4. 

5 

MATERIALS AND METHODS 
Enzymes r antibodies and DNA 

pET-30b and pQE-30 were obtained from Novagen and 
Qiagen, respectively. Purified mouse monoclonal MRGS.His 

0 antibody (Qiagen) that recognises pQE-30-derived His-tagged 
proteins was used as per manufacturer's instructions. Ku70, 
Ku80, and DNA-PK^s antisera were used as described previously 
(Hartley et aJ., 1995; Finnie et ai., 1996). Antibodies 
against ligase I (TL5), ligase III (TL25) and ligase IV 

5 (TL18) were used as described by (Lasko et ai., 1990; Robins 
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and Lindahl, 1996) . Antigen-antibody complexes were 
detected by enhanced chemi-luminescence (Amersham) according 
to the manufacturer's instructions. HeLa nuclear extract 
was obtained from Computer Cell Culture Centre, Mons, 
5 Belgium. All plasmid constructs were verified by automated 
DNA sequencing. 

Expression and purification of XRCC4 derivatives 

To generate recombinant untagged XRCC4, the full-length 

10 XRCC4 coding region was amplified from pBlueScript 

containing the human XRCC4 gene by PGR and inserted into 
pET-30a (Novagen) digested with Nde 1/Sal I such that the N- 
terminal His/S tags are not present and so that the KRCC4 
stop codon prevents the addition of the C-terminal His-tag. 

15 For protein expression, BL21{DE3) cells harbouring the 

resulting plasmid (pET30XRCC4) were grown in 500 ml cultures 
of LB/kanamycin (50 pg/ml) to mid-log prior to induction 
with 0.4 mM IPTG for 4 h at 37°C. After lysing the 
collected cell pellet by sonication, 30.2 g of ammonium 

20 sulphate was added slowly per 100 ml of supernatant and 
incubated with stirring at 4°C for 30 min. After 
centrifugation, the pellet was resuspended in TED (50 mM 
TriS-HCl pH 7.5, 2 mM DTT and 1 mM EDTA) and dialysed 
extensively against TED. Protein was then loaded onto a 

25 heparin-Sepharose column pre-equilibrated with TED and 
protein was eluted with a linear gradient of 0 to 0.6 M 
NaCl. Fractions containing XRCC4 were pooled and dialysed 
against TED containing 1.0 M ammonium sulphate then were 
loaded onto a pre-equilibrated phenyl-Sepharose column. 

30 Proteins were eluted with a 100 ml linear gradient of 1.0 
to 0 M (NH4)2S04. Fractions containing XRCC4, eluting at 
-0.2 M {m^) 2S0^, were pooled and dialysed against 50 mM 
Tris.HCl pH 7.5, 2 mM DTT, 1 mM EDTA and 10% (w/v) glycerol, 
and stored at -80 '^C . 
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Antl-XRCC4 antibody production and purification 

Regions of the XRCC4 gene were amplified by PGR from 
pBlueScript containing the human XRCC4 gene then were 
inserted in-frame downstream of the hexa-histidine (His) tag 
5 of pQE-30 (Qiagen, USA) and were expressed and purified 
according to the manufacturer's instructions from the 
soluble fraction of bacterial lysates. Antibodies against 
the soluble recombinant proteins were raised in rabbits 
using standard procedures (Harlow and Lane, 1988) and are 

10 available commercially from Serotec, UK. Western immunoblot 
analyses were performed as described previously (Harlow and 
Lane, 1988) and blots were developed by enhanced chemi- 
luminescence (Amersham) . Recombinant His-tagged full-length 
XRCC4 was attached to Sulfolink Coupling Gel (Pierce, USA) 

15 and was used to immuno-af f inity purify anti-XRCC4 antibodies 
from crude SJ4 serum as described previously (Lakin et ai., 
1996) . 



Phosphatase- treatment of HeLa cell extracts 
20 To analyse for XRCC4 phosphorylation, HeLa nuclear 

extract (50 pg) was treated with X protein phosphatase (New 
England Biolabs) in the presence of 2 mM MnCl2 and incubated 
for 30 min at 30°C prior to SDS-PAGE and Western blotting. 

25 Co~inununopreclpltatlons and llgase adenylylatlon assays 

XRCC4 was immunoprecipitated from HeLa nuclear extract 
using polyclonal anti-XRCC4 and a pre-immune control. 
Specifically HeLa nuclear extract was dialysed into Buffer 
D* (20 mM HEPES-KOH, 20% (w/v) glycerol, 50 mM KCl, 2 mM 

30 MgClj, 0.2 mM EDTA, 1 mM DTT, 0 . 5 mM PMSF, 1 mM sodium 
metabisulphite and 0.1% NP-40) and then incubated with 
either pre-immune or immune serum for 1 h at 4**C in the 
presence of 50 pg/ml ethidium bromide to disrupt non- 
specific interactions (Lai and Herr, 1992) . Immune 

35 complexes were bound to protein A Sepharose beads 
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(Pharmacia), followed by extensive washing with Buffer D* 
containing 0.15-1 M NaCl . The protein A Sepharose beads 
were finally washed in Buffer D* containing 0,15 M NaCl 
prior to analysis. The samples were then tested for the 
5 ability to form DNA ligase-adenylated complexes as described 
previously (Robins and Lindahl, 1996) . The polynucleotide 
substrates oligodT«polyrA and oligorA-polydT were prepared as 
described (Tomkinson et al., 1991). The reactivity of the 
enzyme-adenylate intermediates formed was examined by adding 
10 0.8 ]^g of unlabelled oligodT'polyrA or oiigorA«polydT for 1 
hr at 30°C. The reactions were stopped by the addition of 
SDS sample buffer and adenylylated proteins detected by 
autoradiography following SDS-PAGE. 

15 Gel- filtration chromatography 

Total HeLa nuclear extract (6 mg protein) was dialysed 
extensively against Buffer A (50 mM Tris-HCl pH 7.5, 1 mM 
EDTA, 0.5 mM DTT, 10% (w/v) glycerol) containing 1 M NaCl. 
The protein was then loaded onto a Superose 6 (Pharmacia) 

20 column (60 x 1.5 cm) pre-equilibrated with Buffer A 

containing 1 M NaCl. On an identical gel-filtration run, 
0.2 mg of pure untagged recombinant XRCC4 was analysed in 
buffer A containing 1 M NaCl. 

25 Purification of DNA ligase IV from HeLa cells 

DNA ligase IV was purified from HeLa cells as described 
previously (Robins, 1996) . Fractions collected from each of 
the columns were analysed by immunoblots with antibodies 
specific for DNA ligases III and IV and XRCC4 . 

30 

Expression of recombinant ligase IV derivatives 

For generation of recombinant ligase IV derivatives, 
fragments of the human ligase IV gene coding region were 
amplified by PCR from reverse transcribed HeLa RNA. Each 
35 PCR product included a Bam HI site at the 5' end and a stop 
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codon followed by aSal 1 site at the 3' end, and after 
digestion were ligated into pET30b digested with Bam Hl/Sal 
I. The 550-844 fragment of ligase IV was also cloned into 
pQE-30 and the resultant clone was expressed in E. coll 
5 M15{Rep4). For In vitro transcription and translation of 
ligase IV fragments, 1 pg of pET30LigIV ( 1-198 ) , 
pET30LigIV(199-549) , pET30LigIV (550-844 ) or a luciferase 
control (Promega) were transcribed In vitro and translated 
using the TnT rabbit reticulocyte lysate kit (Promega) 

10 according to the manufacturer's instructions. Resulting N- 
terminally His-tagged ligase IV products were purified by 
Ni^''"-NTA agarose chromatography. 

Briefly/ a 100 pi bed volume of Qiagen Ni^"*'-NTA agarose 
was pre-equilibrated in wash buffer (50 mM Tris-HCl pH 7.5, 

15 20 mM imidazole, 10% (w/v) glycerol and 0.5 M NaCl) prior to 
the addition of 20 \il of the crude [^^S] methionine-labelled 
In vitro translated ligase fragment. Unbound proteins were 
removed after low-speed centrif ugation and the resin was 
washed 3 times with wash buffer to remove non-specif ically 

20 bound proteins. Finally, ligase IV proteins were eluted 
with 100 pi of elution buffer consisting of 50 mM Tris-HCl 
pH 7.5, 100 mM imidazole and 10% (w/v) glycerol. 

Interaction assays between recombinant XRCC4 and ligase IV 

25 derivatives 

Full-length XRCC4 was immobilised on Sepharose-4B gel 
beads (Pharmacia) using the cyanogen bromide method 
according to the manufacturer's instructions. As a negative 
control, coupling was also performed without XRCC4 protein. 

30 A 30 pi bed volume of beads (with and without XRCC4) was 

pre-equilibrated with Binding Buffer (50 mM Tris-HCl pH 7.5, 
2 mM DTT, 1 mM EDTA, 10% (w/v) glycerol, 0.1% NP-40 and 0.36 
mg/ml BSA) before the addition of the His-tag purified In 
vitro translated ligase. products or luciferase. Unbound 
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material was collected after centrif ugation and the beads 
were washed with Binding Buffer containing 0.1 M NaCl and 
1.0 M NaCl. The beads were resuspended in gel loading 
buffer and boiled in preparation for analysis by SDS-PAGE. 

5 Finally SDS-PAGE gels were fixed in 10% acetic acid for 30 
min and dried, and labelled proteins were detected by 
autoradiography. The ability of the recombinant C-terminal 
fragment of ligase IV (residues 550-844) to bind XRCC4- 
Sepharose beads was tested as above, except that analysis 

0 was by Coomassie-brilliant blue staining and immunoblott ing 
with the anti-ligase IV antibody. 

EXAMPLE 2 

DETERMINATION OF BIOLOGICAL ACTIVITY OF DNA LIGASE IV 

5 

Identification of a second^ hitherto uncharacterised^ yeast 
DNA ligase 

Using a consensus sequence within the core catalytic 
domain of all published DNA ligases the present inventors 

0 searched through the recently fully sequenced S. cerevisiae 
genome (Goffeau et al., 1996). 

In addition to detecting CDC9f which encodes DNA ligase 
I, these searches identified an ORF (YOR005c) present on 
chromosome XV as a highly significant hit. 

5 This ORF encodes a 944 amino acid residue polypeptide 

of predicted molecular mass of 109 kDa that exhibits 
extensive similarity (24 % identity; 43 % similarity) in its 
central region to the "core" ligase conserved domain of DNA 
ligase I (Figure 2). 

0 Phylogenetic analyses of protein alignments over this 

region reveal that YOROOSc is considerably more related to 
DNA ligases of eukaryotes and eukaryotic viruses than to 
those of prokaryotes, and in particular, is most closely 
related to human ligase IV. A phenogram was generated using 

5 the PHYLIP package with the aligned "core" conserved 
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sequence of each protein as designated in Figure 2 using the 
UPGMA method. Accession numbers are as follows: A. thaliana 
I {X97924); C. albicans CX95001); C. elegans I (Z73970) ; 
Fowlpox virus (U00761) ; H. sapiens I (Iyi36067) , III (X84740) 
5 and IV {X83441) ; M. musculus I (U04674); Rabbit fibroma 
virus (Z29716); S. cerevislae 1 (X03246) , IV (Z74913); S. 
pombe I (X05107); T7 bacteriophage (P00969) ; Vaccinia virus 
(X16512); X. laevls I CL43496) . 

Consistent with this, YOROOSc shares an N-terminal 
10 extension with the mammalian enzymes that is lacking in 
prokaryotic DNA ligases. Furthermore, it possesses an 
additional C-terminal extension that is homologous 
throughout its length to that of mammalian ligase IV. We 
thus conclude that YOROOSc encodes a homologue of mammalian 
15 DNA ligase IV and designate this locus LIG4 . 

Disruption of LIG4 does not lead to marked hypersensitivity 
to a variety of DNA-damaglng agents 

To study LIG4 function, we inactivated this gene in the 

20 haploid yeast strain W303a by a one-step gene disruption. 

Notably, resulting llg4 mutants do not have readily 
observable growth defects when propagated at temperatures 
ranging from 18°C to 37°C. This contrasts markedly with 
CDC9, the gene encoding yeast ligase I, whose disruption 

25 results in lethality due to an inability to progress through 
S-phase (Johnston and Nasmyth, 1978). It is thus concluded 
that LIG4 does not play an essential role in DNA 
replication, and that yeast ligase I is the only DNA ligase 
required for this process. 

30 Next, we tested whether llg4 mutant yeast are defective 

in any of the predominant DNA repair pathways by assessing 
their sensitivity to killing by various DNA damaging agents. 
Notably, llg4 mutant strains are not hypersensitive to DNA 
damage induced by exposure to ultraviolet radiation, showing 

35 that it is not essential in nucleotide excision repair. In 
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addition, strains disrupted for LIG4 are not hypersensitive 
a range of concentrations of the radiomimetic drug, methyl 
methanesulf onate (MMS) in the growth medium. Finally, lig4 
mutant yeasts also do not display significantly elevated 
5 sensitivity to killing by ionising radiation at a range of 
doses (0 - 45 kRad) . Unlike rad52 mutants, llg4 mutant 
yeast are as resistant to ionising radiation as parental 
strains: llg4 mutants were not significantly more sensitive 
even at high doses (up to 4 5 kRad) {Figure 3A) . 

10 Since radiation-induced DNA double-strand-breaks (DSBs) 

are repaired primarily by homologous recombination in S. 
cerevisiae, these data suggest that LIG4 is not essential 
for the latter process. Consistent with this, we have found 
that the efficiency of homologous recombination-mediated 

15 targeted integration into various loci in the yeast genome 
is indistinguishable between wild-type and lig4 mutant 
strains . 

LJG4 functions in the Ku -dependent NHEJ pathway of DNA 

20 doujbie-strand break repair 

In S. cerevisiae, radiation-induced DNA double-strand 
breaks (DSBs) are repaired primarily by homologous 
recombination, which is mediated by genes in the RAD52 
epistasis group (Friedberg et al., 1995). Thus, disruption 

25 of RAD52 sensitises yeast cells to ionising radiation 

(Figure 3A) . However, eukaryotic cells can also repair DNA 
DSBs by a second pathway, non-homologous end- joining (NHEJ) , 
that utilises gene products distinct from those employed in 
homologous recombination. In both yeast and mammals, one of 

30 these components is the DNA-binding protein Ku, comprising 
subunits of -70 kDa and -80 kOa [Ku70 and Ku80 in mammals 
(Jackson and Jeggo, 1995); Yku70p/Hdflp and Yku80p/Hdf2p in 
yeast (Feldmann and Winnacker, 1993; Boulton and Jackson, 
1996a; 1996b; Feldmann et ai., 1996; Mages et al,, 1996; 

35 Milne et ai., 1996; Siede et ai., 1996; Tsukamoto et ai . , 
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1996)]. NHEJ appears to be the predominant pathway for DSB 
repair in mammals but represents a minor pathway in yeast; 
consequently, disruption of 5. cerevlslae YKU70 or YKU80 
only results in significantly increased sensitivity to 
5 ionising radiation or MMS when homologous recombination is 
inoperative (Boulton and Jackson, 1996a; 1996b; Milne et 
ai., 1996; Siede et ai., 1996). 

We have found that Iig4/rad52 double mutants are 
considerably more radiosensitive than are strains disrupted 
10 for RAD52 alone (Figure 3B) . As in the case for YKU70 

(mutant in the Ku70 subunit of yeast Ku) , disruption of LIG4 
hypersensitises yeast to ionising radiation in rad52 mutant 
backgrounds. Furthermore, Iig4/yku70/ rad52 triple mutants 
are not appreciably more radiosensitive than are yku70/rad52 
15 double mutant or Ilg4/rad52 double mutant strains, 

indicating that Lig4p and Yku70p function on the same RAD52- 
independent repair pathway. This provides indication that 
LIG4 is involved in the repair of ionising radiation-induced 
DNA damage and that it functions in a i?AD52-independent 
20 pathway. 

Since the effects of disrupting LIG4 are similar to 
those obtained by disrupting YKU70 or YKU80, we assessed the 
radiosensitivity of Iig4/yku70/ rad52 triple mutants. Such 
mutant strains are no more sensitive to ionising radiation 
25 than are the Iig4/rad52 or yku70/rad52 mutant strains 
(Figure 38) . 

Taken together, these data provided indication that Ku 
and LIG4 function in the same DNA repair pathway. 

30 Previous work has shown that Ku functions in DNA NHEJ 

and that this process can be measured through employing an 
in vivo plasmid repair assay (Boulton and Jackson, 1996a; 
1996b; Milne et ai . , 1996). In this assay, a yeast-S. coli 
shuttle plasmid pBTM116 (Boulton and Jackson, 1996a; 1996b) 



wo 98/30902 PCT/GB98/00095 

79 

is linearised by restriction enzyme digestion, then is 
introduced into S. cerevisiae by transformation. The 
plasmid contains the yeast selectable marker TRPl, the 3- 
lactamase gene, the ADHl promoter and £;coRI and Pstl 
5 restriction sites. Since the plasmid must be recircularised 
to be propagated, the number of yeast transformant colonies 
obtained quantifies the ability of the strain to repair the 
plasmid. Furthermore, since the DNA DSB generated in these 
studies resides in a region that is not homologous to the 

10 yeast genome, homologous recombination is suppressed and 
repair operates predominantly via NHEJ. 

We therefore analysed the ability of llg4 mutant yeasts 
to repair pBTM116 after cleavage with various restriction 
endonucleases . Wild-type, lig4, ykulO and Iig4/yku70 yeast 

15 strains were used. Cells for each strain were transformed 
in parallel with supercoiled pBTM116 or with an equivalent 
amount of pBTMllS that had been digested to completion with 
EcoRl (left panel of Figure 4) or Pstl (right panel of 
Figure 4). For each experiment, the value plotted is the 

20 number of transf ormants obtained with the linear plasmid 
expressed as a percentage of the number obtained for 
supercoiled DNA. Each experiment was repeated at least three 
times and, in each case, cells were plated and counted in 
duplicate . 

25 As with strains disrupted for YKU70 or YKUeO, lxg4 

mutant strains are severely impaired in plasmid NHEJ, and 
this is observed both with 5' or 3' overhanging DNA ends. 
Interestingly, these studies reveal that the effect of l±g4 
mutations is less pronounced with 3* overhanging DNA ends 

30 than it is with 5' overhanging ends. Although other 
alternatives exist, it is possible that this reflects 
differences in the mechanisms by which the two types of DNA 
ends can be repaired or is due to differential sensitivities 
of the different end structures to nuclease attack. Notably, 

35 DNA repair is not impaired further in ykulO/ lig4 double 



wo 98/30902 PCT/GB98/00095 

80 

mutant strains (Figure 4) . In addition, although the precise 
reason for this effect is not known, as is the case for 
yku70 or yku80 mutants, we have found that llg4 mutant 
yeasts have a slightly elevated ability to rejoin pBTM116 
5 bearing blunt-ends. 

Taken together, these results reveal that Lig4p plays a 
crucial role in the repair of plasmid molecules bearing 
cohesive DNA double-strand breaks in vivo. Secondly, they 
show that, although purified DNA ligase I {CDC9) has been 

10 shown to be capable of catalysing DSB joining in vitro 

{Tomkinson et ai., 1992), this enzyme does not play a major 
role in this pathway as assayed by ,in vivo plasmid DSB 
rejoining, and is unable to substitute efficiently for Lig4p 
in this process. Finally these results also show that Lig4p 

15 plays an important role in the Ku-dependent NHEJ pathway. 

Although plasmid repair is reduced dramatically in lig4 
mutant strains, it is not abolished. To determine precisely 
the types of DNA repair events that are dependent or 

20 independent of LIG4, repaired plasmids were recovered and 
then analysed by restriction enzyme digestion and DNA 
sequencing. In the absence of functional LIG4, the cohesive 
DNA termini are repaired by an inefficient error-prone DNA 
repair pathway. The plasmid pBTM116 contains the ADHl 

25 promoter and some repair products were generated by the gap- 
repair process involving the chromosomal ADHl gene. Of the 
large number of plasmids recovered from parental strains, 
all had been repaired by direct ligation of the cohesive DNA 
termini, thus regenerating the restriction enzyme cleavage 

30 site (Boulton and Jackson, 1996a; 1996b) . Plasmid repair 
products recovered from yku70 or lig4 mutant strains, 
however, were found to fall into several categories. Some of 
these corresponded to "gap repair" products which we have 
shown are generated via -RAD5 2- dependent homologous 

35 recombination with yeast genomic DNA (sequence analyses 
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reveal that homologous recombination is employed in the 
generation of these products and their production is 
abolished by disruption of RAD52; Boulton and Jackson, 
1996a; 1996b and data not shown) . This therefore provides 
5 further evidence that like YKU70 and YKU80, does not 

play a crucial role in homologous recombination processes. 
In ykulO or yku80 mutant strains, virtually all of the 
residual repair products were found to have suffered 
deletions (Boulton and Jackson, 1996a; 1996b) . In contrast, 

10 although many of the residual repair products generated in 
lig4 mutants had also suffered deletion of terminal 
sequences, some were rejoined accurately. 

Collectively, these results provide insights into the 
distinct roles performed by Lig4p and Ku in DNA NHEJ (see 

15 Discussion below) . 

Lig4p, unlike Ku, does not appear to function In telomere 
length maintenance 

Telomeres occur at the ends of eukaryotic chromosomes, 

20 are structurally distinct, and have unusual replication 

intermediates for which it is unclear whether a distinct DNA 
ligase is necessary (Blackburn, 1991; Zakian, 1995; Lundblad 
and Wright, 1996) . Recent work has demonstrated that Ku 
functions in telomere homeostasis, since disruption of 

25 either YKU70 or YKU80 results in a dramatic reduction in 

telomeric length (Boulton and Jackson, 1996b; Porter et aJ., 
1996) . 

Given that Lig4p and Ku function together in DNA NHEJ, 
we tested whether LIG4 is involved in telomere length 
30 control. 

To do this, yeast genomic DNA was digested with the 
restriction enzyme Xhol, which in wild-type strains produces 
a predominant telomeric fragment of -1.3 kb that is detected 
by Southern hybridisation to an oligonucleotide probe that 
35 binds to the repetitive telomeric sequences, including -400 
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bp of repeating iC-y_2^^ sequence and is detected by Southern 
hybridisation using a radiolabelled poly (GT)2o 
oligonucleotide. Notably, whereas disruption of YKU70 
results in telomeric shortening, loss of LIG4 function has 
5 no detectable effect. 

These data therefore reveal that, although Ku and Lig4p 
function together in DNA DSB repair, Ku but not Lig4p has an 
additional essential function in telomere length 
homeostasis . 

10 

MATERIALS AND METHODS 

Gene disruptions 

Full length LIG4 was amplified by PGR with primers 

15 LIG4~1 and LIG4-2 (5' TCAGTAGTTGACTACGGGAAAGTCT 3' and 5' 

ATGATATCAGCACTAGATTCTATAC 3', respectively) using the Expand 
High Fidelity DNA polymerase (Boehringer Mannheim) . After 
cloning into pGEM-T (Promega) , the resultant plasmid was 
digested with EcoRI, treated with Pfu DNA polymerase and 

20 then digested with Xbal . The HIS3 marker was inserted to 
replace the L1G4 ORF between residues 289 and 592. The 
disruption fragment was excised with Sphl and Spel and was 
used to transform the appropriate yeast strains to His"^. 
Gene disruption was verified by using LIG4 and HIS3 primers 

25 in PGR. Two RAD52 disruption constructs were provided by D. 
Weaver and have TRPl and URA3 selection respectively. 

Assessment of sensitivity to temperature and DNA damaging 
agents 

30 Aliquots (15 pi) of serial 5-fold dilutions of mid-log 

phase yeast cultures were spotted onto YPDA plates and were 
grown for 36 h at 30°C or 37 °G. Strains on one plate were 
exposed to 50 J/m^ ultraviolet (UV-G) radiation 
(Stratalinker ; Stratagene) . On another plate, YPDA medium 
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contained 0.0025% methyimethanesulf onate . In other studies, 
lig4 mutant strains did not display hypersensitivity to MMS 
(0.0005% and 0.005% in the growth medium) nor to UV-C (20 - 
150 J/m^) . 

5 

Ionising irradiation survival assays 

Three independent isolates of each strain were 
inoculated either into minimal media lacking the appropriate 
amino acid(s) or into YPDA and were grown overnight at 30°C. 
10 Cultures were diluted in sterile water to an ODgoonm value 
equivalent to 1 x lO'' cells/ml and 1 ml aliquots were 
irradiated using a ^•^'^Cs source at a dose of 0.18 kRad/min. 
I Irradiated samples and unirradiated controls were then 

diluted and plated in duplicate using an automated spiral 
1 15 plater (Whitley) on YPDA or minimal media. Colony numbers 
\ were ascertained following incubation at 30°C for 3-4 

days . 

; Plasmid repair assay 

\ 20 Plasmid repair assays were performed as described 

^ previously (Boulton and Jackson, 1996a; 1996b) . Briefly, the 

)/^ast- Escherichia coli shuttle plasmid pBTM116 (2-5 pg) , 
which contains TK9\ for selection in yeast, was digested 
with the appropriate restriction enzyme to completion and 

25 the enzyme was inactivated by treatment at 65°C for 20 min. 
Linearised DNA was then used to transform yeast by the 
lithium acetate method (Ausubel et al., 1987). Parallel 
transformations were performed with an equivalent amount of 
uncut plasmid to enable normalisation for differences in 

30 transformation efficiency. Diluted samples were plated in 
duplicate on minimal media lacking the appropriate amino 
acids, and colonies were counted following incubation at 
30°C for 3-4 days. To analyse plasmid repair products, DNA 
from single yeast transf ormants was isolated via the Yeast 

35 DNA Isolation kit (Stratagene) and this was used to 
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transform E. coli XLl-Blue cells (Stratagene) to ampicillin 
resistance. Plasmid DNA was then isolated and was analysed 
by restriction enzyme digestion and by DNA sequencing. 

5 Yeast DNA extraction and analyses of telomerlc DNA 

Genomic DNA from S. cerevisiae was isolated essentially 
as described (Ausubel et al., 1987). For telomere analyses, 
2 pg of genomic DNA was digested with 30 U of Xhol 
(Boehringer Mannheim) at 37°C overnight. The digested DNA 

10 was then separated on a 1.2 % agarose 1 x TAE gel and was 
transferred to Hybond Nfp+ membrane (Amersham) by capillary 
transfer in 20 x SSC as suggested b.y the manufacturer. 
Membranes were pre-hybridised in 0.5 M sodium phosphate, pH 
7.2, 1% SDS and then hybridised with 3 ng/ml of ^^P-end- 

15 labelled poly (GT)2o oligonucleotide (specific activity of 
>10^/pg) in a Church-based buffer (0.2 M sodium phosphate, 
pH 7.2, 1% BSA, 6% polyethylenegiycol 6000, 1% SDS) 
overnight at 62'*C, Finally, membranes were washed twice at 
room temperature for 30 min in 0.2 M sodium phosphate, 0.1% 

20 SDS, then exposed to pre-flashed X-ray film at -70°C. 

DISCUSSION 

The inventors' work with XRCC4 indicates that it is a 
predominantly nuclear protein. Moreover, through a variety 

25 of approaches, we have demonstrated that XRCC4 mediates 
extremely tight and specific interactions with DNA ligase 
IV. For example, these two components co-immunoprecipitate 
highly specifically with one another from HeLa cell 
extracts, even in the presence of 1 M NaCl . Furthermore, 

30 such interactions are not abrogated by ethidium bromide, 

indicating that the interaction between XRCC4 and ligase IV 
is not mediated by a DNA intermediate. Indeed, we show that 
bacterially expressed XRCC4 and ligase IV also bind to one 
another tightly, revealing that their interaction is direct. 

35 In addition, XRCC4 and ligase IV co-purify over every 
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chromatographic fractionation procedure we have employed, 
including gel-filtration in the presence of 1 M NaCl, anion 
and cation exchange chromatography, and hydrophobic 
interaction chromatography. Indeed, so far we have only 
5 resolved these two proteins by the addition of harsh ionic 
detergents , 

The fact that XRCC4 interacts tightly with ligase IV 
but not with the other DNA ligases that we have analysed has 
lead us to investigate the basis for this binding 

10 specificity. Notably, although all characterised mammalian 
DNA ligases contain a common highly related core catalytic 
region, each possesses unique N- and/or C-terminal 
extensions. We have found that it is the unique C-terminal 
domain and not the ligase catalytic region of ligase IV that 

15 interacts with XRCC4 . Interestingly, this region of ligase 
IV contains two tandem copies of the weakly conserved BRCT 
homology domain (Koonin et ai., 1996; Callebaut and Mornon, 
1997), leading one to speculate that it is one or both of 
these domains that mediate the interaction with XRCC4 . BRCT 

20 domains also exist in a variety of other factors and are 

required for those factors to interact (Mackey et al., 1997; 
Nash et ai., 1997), suggesting that the BRCT domain of one 
protein interacts with a BRCT domain of the other. Thus, in 
light of our work showing interaction between XRCC4 and DNA 

25 ligase IV it might be expected that XRCC4 would also possess 
one or more copies of the BRCT consensus . Although XRCC4 
has not been identified as a BRCT domain-containing protein 
by previous analyses, we have used manual and computer-aided 
inspections of the XRCC4 sequence to reveal limited 

30 homologies to other BRCT domains, suggesting that XRCC4 

might contain one or more divergent copies of this putative 
protein structural unit. 

Given that XRCC4 clearly functions in DNA NHEJ, our 
data indicate that DNA ligase IV also plays a crucial role 

35 in this process. XRCC4 may serve as a molecular bridge to 
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target ligase IV to DNA DSBs, perhaps through XRCC4 also 
interacting with other components of the DNA NHEJ apparatus 
(Figure 5) . In regard to such a putative bridging function 
for XRCC4, it is worthy of note that immunoprecipitation 
5 studies suggest that XRCC4 can interact with Ku and/or DNA- 

PK although these interactions only occur at low salt 

concentrations and hence are weak compared to those 
exhibited between XRCC4 and ligase IV. A possible physical 
linkage between XRCC4 and DNA-PK is attractive in light of 
10 the fact that we have shown that HeLa cell XRCC4 is a 

phospho-protein and is an effective substrate for DNA-PK in 
vitro. Since XRCC4 possesses a DNA-PK kinase consensus 
motif (Li et ai.,- 1995)- mutation of this S-itf^ may affect 
XRCC4 function in vivo. 

15 

Consistent with the proposal that ligase IV plays an 
important role in DNA DSB repair, we have identified a S. 
cerevisiae homologue of DNA ligase IV (displaying extensive 
sequence similarity along its length with mammalian DNA 

20 ligase IV) and have shown that inactivation of this factor 
debilitates DNA NHEJ in a manner that is epistatic with 
mutations in the yeast homologues of Ku70 and Ku80 (Boulton 
and Jackson, 1996; Boulton and Jackson, 1996; Teo and 
Jackson, 1997) . By contrast, we find that yeast ligase IV 

25 does not appear to play an essential role in the repair of 
ultraviolet light-induced DNA damage nor in the repair of 
DNA DSBs by homologous recombination (Teo and Jackson, 
1997) . Taken together with the data on XRCC4, this provides 
indication that ligase IV is dedicated to DNA NHEJ and that 

30 this function is conserved throughout the eukaryotic 
kingdom. 

The yeast gene, which we have designated LIG4, is not 
essential for DNA replication, RAD52-dependent homologous 
recombination nor the pathways of nucleotide excision repair 
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and base excision repair. Instead, we have shown that LIG4 
is specifically involved in the rejoining of DNA double- 
strand breaks by the process of DNA NHEJ, which does not 
demand homology between the two recombining DNA molecules 
5 and does not require RAD52. Notably, genetic epistasis 
experiments reveal that LIG4 acts in the same DNA repair 
pathway as Ku, a nuclear protein that specifically 
recognises DNA strand breaks. We have thus identified a 
novel 5. cerevisiae DNA ligase and have shown that it is 
10 involved specifically in the Ku-dependent NHEJ pathway of 
DNA DSB repair. 

In light of this, and given that mutations in YKU70 or 
YKU80 result in dramatic telomeric shortening in yeast 
(Boulton et ai., 1996b; Porter et ai,, 1996), we have also 
15 assessed the potential involvement of LIG4 in telomere 

length homeostasis. Telomeres are the protein-DNA structures 
at the ends of eukaryotic chromosomes that ensure the 
complete replication of chromosome ends, protect these ends 
from degradation, and prevent chromosomal termini from 
20 activating DNA damage signalling pathways or engaging in 
fusion and recombination reactions with other loci [for 
reviews (Blackburn, 1991; Zakian, 1995; Lundblad and Wright, 
1996) ] . In most organisms, telomeres are composed of 
variable numbers of simple repeat sequences and, at least in 
25 S, cerevisiae, the length of these sequence arrays is 
maintained by a combination of telomerase activity and 

dependent and -independent recombination. In yeast, 
deficiencies in Ku result in an approximately 70% reduction 
in the number of telomeric repeat sequences (Boulton et ai., 
30 1996b; Porter et ai., 1996). Given that Ku binds to the ends 
of double-strand DNA (Mimori and Hardin, 1986; Paillard and 
Strauss, 1991), one possibility is that Ku may interact 
directly with telomeric DNA ends and potentiate telomere 
lengthening by protecting telomeric DNA termini from 
35 nucleases or by augmenting telomerase recruitment. An 
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alternative explanation is that the effect of Ku 
inactivation on telomere length is indirect - perhaps the 
DNA repair defects that are associated with Ku deficient 
yeasts result in changes in cell physiology that impinge 
5 indirectly on telomere length control. Although it is not 
possible at present for us to identify precisely how Ku 
affects telomere length, the fact that mutations in LIG4 
have essentially the same DNA repair defect as Ku but do not 
alter telomere length argues for a specific role for Ku in 

10 telomere homeostasis that is distinct from its activities in 
DNA DSB repair. In this regard, it will be of interest to 
see whether mutated derivatives of Ku can be generated that 
have no effect on DNA repair but do result in defective 
telomeric maintenance. 

15 Yeast cells mutated in LIG4 have pronounced defects in 

DNA NHEJ, showing that Lig4p plays a crucial role in this 
process that cannot be complemented efficiently by yeast DNA 
ligase I. Conversely, yeast CDC9 and human DNA ligase I 
mutants are defective in DNA replication and, at least In 

20 vitro, this function is not performed efficiently by other 
enzymes. This indicates that yeast DNA ligases I and IV have 
distinct and largely separate cellular functions and cannot 
substitute effectively for one another. Thus, DNA ligase I 
plays a crucial role in DNA replication and also appears to 

25 seal single-strand DNA breaks that are the end-products of 
nucleotide- and base-excision repair, and moreover, is 
likely to complete recombination events between homologous 
duplex DNA molecules. There are also data suggesting that 
mammalian DNA ligase III is specialised towards particular 

30 functions. One splice variant (DNA ligase Ill-of) may operate 
in a separate pathway for base excision repair while another 
variant (DNA ligase III-p) has been implicated in meiotic 
recombination. Notably, there are no obvious homologues of 
mammalian DNA ligase II/ III in S. cerevlsiae. However, 

35 sequence analyses (Fig 1; Colinas et ai., 1990; Kerr et ai.. 
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1991; Husain et ai., 1995) reveal that these ligases are 
related more closely to DNA ligases encoded by cytoplasmic 
poxviruses than they are to DNA ligase I, suggesting that 
ligases II and III may have arisen fairly recently in 
5 vertebrate evolution. Interestingly, and largely consistent 
with the proposed functions for mammalian ligase III, 
inactivation of poxvirus DNA ligase does not affect viral 
DNA replication or recombination but renders the mutant 
virus more sensitive to DNA damage induced by UV or 

10 bleomycin (Colinas et al., 1990; Kerr et al., 1991). 
Collectively, these data suggest that DNA ligase I and 
perhaps DNA ligase II/III are involved predominantly in the 
rejoining of single-stranded nicks whereas DNA ligase IV is 
the major enzyme catalysing the joining of double-stranded 

15 breaks. 

In light of these points, and given that LIG^ functions 
in the highly evolutionarily conserved Ku-dependent NHEJ 
pathway, mammalian DNA ligase IV is indicated as having a 
key role in Ku-dependent DNA DSB rejoining. As is the case 
20 for Ku {reviewed in Jackson and Jeggo, 1995) , deficiency in 
mammalian ligase IV may result in cellular radiosensitivity 
and an inability to rejoin site-specific V(D)J recombination 
intermediates . 

Although the available data suggest diversification of 
25 function for the different eukaryotic DNA ligases, it is 
unclear whether this arises from intrinsic differences in 
catalytic activity or from differences conferred, for 
example, by the distinct C- and N-terminal extensions of the 
enzymes. At least in vitro, purified human DNA ligases I, 
30 III and IV show differing capacities to join single-stranded 
breaks in hybrid polynucleotide substrates (Arrand et ai., 
1986; Tomkinson et ai., 1991; Robins and Lindahl, 1996). 
Furthermore, purified mammalian DNA ligases differ in their 
abilities to rejoin DNA DSBs . It is noteworthy, however, 
35 that in contrast with the available in vivo data, these 
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studies show that purified ligase I but no other mammalian 
DNA ligase is able to catalyse the joining of blunt DNA ends 
effectively in vitro (Arrand et al., 1986; Tomkinson et al., 
1991; Tomkinson et al., 1992; Robins and Lindahl, 1996). One 
5 possible explanation for this discrepancy between the in 
vitro and in vivo data is that at least some of the 
eukaryotic DNA ligases may not have high intrinsic affinity 
for DNA and, within the cell, are targeted to appropriate 
DNA lesions by accessory factors. Consistent with this is 
10 the identification herein of strong interaction between DNA 
ligase IV and XRCC4 . 

Inactivation of either yeast Ku or Lig4p both result in 
a similar dramatic reduction of NHEJ in the in vivo plasmid 
DNA DSB repair assay. Because of this and since the level 
15 of DNA repair does not fall further in yeast strains 

defective in both Ku and Lig4p, we conclude that these two 
factors function in the same illegitimate recombination 
pathway. However, it is apparent that Ku and Lig4p have 
distinct functions in DNA NHEJ, as evidenced by the 
20 different spectra of residual plasmid repair products that 
are generated in the respective mutant strains. Thus, 
whereas nearly all the residual plasmid repair products 
arising in ykulO mutants suffer deletions, in lig4 mutant 
strains these correspond to a mixture of deletion products 
25 and products generated by accurate DNA end-joining. 

Collectively, these results suggest that Ku may function in 
at least two ways to potentiate DNA repair. Firstly, it may 
protect exposed DNA ends from nuclease attack. Secondly, it 
might serve to specifically recruit Lig4p, directly or 
30 indirectly, to the sites of DNA damage, perhaps via the 

Lig4p C-terminal extension that is absent from DNA ligase I. 
Consequently, the phenotypes of strains defective in Ku or 
Lig4p can both be explained to result from an inability to 
target a ligase to DNA DSBs efficiently. In Ku deficient 
35 strains, the ready access of nucleases to the DNA ends may 
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lead to deletions in virtually all the residual NHEJ repair 
products, which presumably arise via inefficient DNA end 
joining by untargeted ligase I or Lig4p. In contrast, when 
Lig4p is absent, Ku is still able to protect the DNA ends 
5 and this can explain how some accurate repair can still 
occur - this presumably being mediated by DNA ligase I, 
However, the reduced repair kinetics in lig4 mutant yeast 
may mean that, even in the presence of Ku, nucleases 
ultimately gain access to the DNA termini and lead deletions 
10 in a large proportion of the residual repair products. 

Consistent with the above model, we find that virtually all 
of the residual NHEJ products generated in yku70/lig4 double 
mutants have sustained terminal deletions. 

15 Interaction betwesn XRCC4 and Ku/DNA-PKcs complex 

XRCC4 interaction with DNA-PKcs/Ku was demonstrated by 
incubation of HeLa cell nuclear extract with anti-XRCC4 or 
pre-immune antiserum with purification of the resulting 
immuno complexes by adsoprtion onto protein A-Sepharose, then 

20 analysis by Western immunoblotting . 

Both DNA-PKcs and the two subunits of Ku are 
immunoprecipitated by the anti-XRCC4 antiserum but not the 
pre-immune serum. 

In these studies, immunocomplexes were washed under 

25 relatively mild conditions of 0.25 M NaCl and 0.1% Nonidet- 
P40. However, when more stringent washes were employed (for 
example, in the presence of 1 M NaCl, 0,1% Nonidet-P40 and 
50 pg/ml ethidium bromide) the interaction between XRCC4 and 
Ku/DNA-PKcs complex was abolished. 

30 Taken together, these data reveal that although the 

interaction between Ku/DNA-PKcs and XRCC4 appears specific, 
it is relatively weak. 

The interaction may be inhibited using appropriate 
agents, including peptide fragments of the respective 

35 proteins. Such agents may be identified and obtained using 
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assay methods and used in therapeutic and other contexts, as 
disclosed above. 
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